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What lies beneath 


A focus on specific biological targets rather than constellations of symptoms heralds a more 
scientific approach to the treatment of neuropsychiatric disorders. 


one person in four suffers from a mental-health problem at some 

point in their lives, or that the sorry state of care for the millions of 
people affected is often discussed only in the immediate aftermath of 
the actions of a single mentally ill individual. One can quibble about 
statistics and diagnostic standards, but the bottom line is that neuro- 
psychiatric disorders account for one of the greatest burdens of disease 
in the developed world, yet patients are not receiving the help they need. 

Part of the problem is that, for many people, the available therapies 
simply do not work, and that situation is unlikely to improve any time 
soon. By the early 1990s, the pharmaceutical industry had discovered 
— mostly through luck — a handful of drug classes that today account 
for most mental-health prescriptions. Then the pipeline ran dry. On 
close inspection, it was far from clear how the available drugs worked. 
Our understanding of mental-health disorders, the firms realized, is 
insufficient to inform drug development. 

With industry pulling out, the burden of developing therapies for 
mental disorders is increasingly falling on academic researchers. And 
with its strapped budget, it comes as no surprise that the US National 
Institute of Mental Health (NIMH) in Bethesda, Maryland, wants to 
make quick and decisive changes to the way its money is spent. 

For several years, the NIMH has been trying to forge a differ- 
ent approach, and late last month institute director Thomas Insel 
announced that the agency will no longer fund clinical trials that do 
not attempt to determine a drug or psychotherapy’s mechanism of 
action (see page 288). Without understanding how the brain works, 
he has long maintained, we cannot hope to know how a therapy works. 

According to the NIMH’s new grant framework, clinical trials 
must be designed to yield scientific knowledge as well as to determine 
whether a therapy works. This will allow researchers to decide quickly 
whether to proceed with a trial, modify it or end it before too much 
money is spent on recruiting and testing patients. To receive funding, 
scientists will have to show how they intend to test whether a therapy 
targets a particular brain circuit, for instance, or what mechanism a 
therapy uses to alter a person's behaviour. 

The move will certainly ruffle feathers. Insel notes that more than 
half of the trials that the NIMH currently supports would not receive 
funding under the new requirements, at least not without modifications. 
For example, a trial that focuses on changes in attention span as a means 
of testing a behavioural-intervention therapy in children with a broadly 
defined disorder such as attention-deficit hyperactivity disorder would 
not be funded unless researchers could present a controlled way to study 
how the therapy takes effect. The NIMH would rather see trials that aim 
to recruit people with a common trait — say, hallucinations — regard- 
less of their specific psychiatric diagnosis, treat them with a drug that 
acts on a specific brain receptor, and measure changes in brain activity. 

The change applies to trials of drugs and behavioural therapies 
alike: the key is that they are set up in such a way that a single variable 


I: is difficult to decide which is the more dispiriting statement: that 


can be changed and a single effect seen. An example would be deep 
brain stimulation (DBS), which has proved an effective treatment for 
disorders ranging from Parkinson's disease to depression, but whose 
underlying mechanism is largely unknown. When a major trial of DBS 
for depression failed last year, scientists were left not knowing whether 
they had sited the devices wrongly or recruited the wrong group of 
patients. It is becoming increasingly apparent that the optimal place- 
ment of and frequency used by such devices to stimulate neurons may 
differ from person to person (see page 290). 


“Our . Critics will argue that the NIMH has 
understanding exchanged a difficult problem — treating 
of mental-health mental illness — for an even more challeng- 
disorders in ing one, understanding the brain. But the 
insufficient to institute’s new direction on trials may also 
inform drug aid an effort to free research on mental-health 


disorders from the limits of existing diagnos- 
tic categories — an approach for which Nature 
has previously expressed support (see Nature 496, 397-398; 2013). 

A project called the Research Domain Criteria, which the NIMH 
is developing, places patients on a matrix on the basis of both their 
symptoms and biomarkers such as brain activity. According to the 
NIMH, more and more grant applicants are using these criteria, 
although no trials have yet been completed. If this work throws up 
new targets for possible intervention, industry may once again see 
psychiatric drugs as a market worth investing in. It isa gamble — and 
a variety of approaches are still needed — but the potential pay-off 
is worth it. m 


development.” 


How to get ahead 


The success of the $1,000 genome programme 
offers lessons for fostering innovation. 


was once considered so far off as to be almost fictional: the 
sequencing of human genomes for US$1,000 apiece. Eleven 
years ago, when genome sequencing cost tens of millions of dollars, 
scientists from the US National Human Genome Research Institute 
(NHGRI) in Bethesda, Maryland, said that, should it be met, the $1,000 
goal “would revolutionize biomedical research and clinical practice”. 
The price of sequencing has indeed dropped — and this develop- 
ment is changing science and beginning to change medicine. In 
retrospect, it may seem that it was inevitable that the cost would come 
down from what it was at the close of the Human Genome Project 


ex technologists are on the verge of fulfilling a goal that 
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in 2003. Entrepreneurs and scientists were already working on tech- 
nologies to replace Sanger sequencing, the workhorse of the genome 
project. But these entrepreneurs and other genome technologists give 
at least some of the credit for the precipitous price drop to the NHGRI. 
In 2003 it launched the $230-million Advanced Sequencing Technol- 
ogy programme, to support research to improve genome technology 
in academia and industry (see page 294). The programme will give out 
its final grants this year, but it has provided some key lessons on how 
to set up similar sponsored innovation efforts. 

Set a clear goal. The NHGRI programme — often referred to as 
the $1,000 genome programme — shares this feature with other suc- 
cessful technological endeavours, including the British government’s 
eighteenth-century Longitude Prize for a practical method to deter- 
mine a ship’s position; NASA’s Apollo Moon shot in the 1960s; and 
even the US Defense Advanced Research Projects Agency's contest 
to sponsor the creation of self-driving cars, first held in 2004. Stating 
a tangible goal galvanizes the field, harnessing researchers’ collective 
brainpower to achieve a common aim. 

Set the bar high, but not too high. The Archon Genomics X Prize 
was cancelled last year after only two teams signed up to take on its 
challenge: the sequencing of 100 complete human genomes in 30 days 
at unparalleled accuracy and low cost. That test was widely seen as too 
difficult, dissuading researchers from trying. The NHGRI, by contrast, 
seems to have set a more reasonable goal; before setting $1,000 as its 
benchmark, it undertook a broad consultation of scientists in the field. 

Spur competition. The 97 groups awarded grants by the NHGRI 
programme so far are made up of academics and companies work- 
ing ona diverse range of concepts. Recipients have included a who's 
who of the sequencing industry past and present, as well as a huge 
array of people working on speculative ideas. Each grant — rang- 
ing from hundreds of thousands of dollars to a few million — is 
small compared with the amount of funding needed to take an 
idea all the way to market. But the spreading of money between so 


many competitors keeps the field lively, and keeps market leaders 
on their toes. 

Foster cooperation. The programme convenes all grant recipients 
every year for a meeting, where all — including those from industry 
— are expected to describe their progress. Jeffery Schloss, the amiable 
head of the programme, deserves much of the credit for coaxing grant 
recipients to share information, and for promoting an atmosphere of 
collegiality that has moved the field forward. 


“Cheaper Seed a broad range of ideas. The vast 
sequencing majority of the programme’s grants — 
1s changing 75% so far — have gone to academics 
science and rather than companies. This has nurtured 
beginning ideas that are too speculative for indus- 
to change try, ensuring that when the market weeds 


medicine.” out less successful companies or throws 
up a monopoly, there is a nursery of new 
concepts waiting to keep the field from stagnating. 

Be flexible. The NHGRI deserves credit for allowing Schloss to 
operate the programme differently from most of its schemes. For 
instance, reviewers were allowed to award small chunks of money to 
parts of an application that seemed promising, rather than cutting off 
funding for every project that failed to meet a milestone. This allowed 
the programme to provide more grants than would otherwise have 
been possible. 

Of course, just incorporating these six elements into a programme 
does not guarantee success. Entrepreneurs and academic scientists 
have contributed ideas and drive to genomics advances, and ven- 
ture capitalists have contributed money. And although the $1,000 
goal is within striking distance, it has not yet enabled the depth of 
understanding needed to make full medical or biological use of the 
knowledge derived from ever more genomes. Attacking that problem 
is the next challenge of genomics. But in part because of the $1,000 
genome programme, biologists are now in a position to address it. m 


Past wisdom 


The recent Nature PastCast series is 
instructive as well as entertaining. 


published in Nature. The findings spawned a multibillion-dollar 

drug industry: the monoclonal antibody drugs Herceptin, used to 
treat breast cancer, and Humira, used to treat rheumatoid arthritis, were 
together worth around US$15.4 billion to the pharmaceutical industry 
in 2012. But the revolution was nearly stillborn: the published technique 
to make monoclonal antibodies could not be reproduced. The crisis 
was such, says historian Lara Marks at King’s College London, that the 
authors considered withdrawing their article from Nature. 

It was years before the investigations paid off and the huge medical 
potential of monoclonal antibodies started to crystallize. “Nowadays 
it seems prophetic,’ says Greg Winter, who made the first monoclonal 
antibody for use in humans. “But at that point I don’t think anyone had 
realized the importance.” 

If those who do not learn from the past are truly doomed to repeat 
the same mistakes, then Nature has done its bit over the past few 
months to help them avoid that fate. 

There are many lessons from history in our archive, alongside 
nuggets of scientific insight and experience. Some of these — and con- 
temporary reflections on them from experts such as Marks — have 
been presented in the Nature PastCast, a 12-part audio series on the 
history of science that draws to a close this week. Some stories featured 
in the series are humbling; others simply entertaining. All are relevant. 


be decades ago, in August 1975, a short paper on antibodies was 


274 | NATURE | VOL 507 | 20 MARCH 2014 


Are you navigating a tricky international collaboration? So was 
astronomer Arthur Eddington, who tried to mount an expedition to 
view a solar eclipse as the First World War was drawing to a close. He 
was an Englishman — a pacifist Quaker to boot — trying to find evi- 
dence to support a German-born physicist’s theory (Albert Einstein's 
general theory of relativity). Or maybe your desk is piled high with a 
backlog of data needing analysis? In the mid-1980s, climate scientist 
Jonathan Shanklin had to sift through reams of the stuff before the 
evidence for the ozone hole popped out at him. 

Or are you having trouble avoiding jargon in your latest write-up? In 
a November 1919 discussion of relativity, the physicist J. J. Thomson is 
reported to have ‘regretted the very complicated form in which Einstein 
expressed his theory, and hoped it might be possible to put it into a form 
in which it would be more generally comprehensible and useful. 

Some of the best stories are those that reflect a simple commitment 
to doing science. In November 1924, palaeontologist Raymond Dart 
was getting ready to appear as best man at his friend’s wedding when a 
box of fossils arrived at his home in Johannesburg. Having an inkling 
ofits content, he couldn't resist opening the package. In his diary, Dart 
recounts how the groom himself had to drag him away. “My God, Ray, 
you've got to finish dressing!” 

Wilhelm Rontgen showed similar mettle. According to Otto 
Glasser’s 1934 biography, in the days after he first discovered X-rays, 
Rontgen ate and slept in his lab “in order to avoid the distracting 
influence of daily trivialities, and to be able to continue his experi- 
ments immediately in case of a sudden inspiration”. Once Rontgen 
had published his X-ray experiments, a journal- 
ist asked him what he thought of his new find- 
ings. Réntgen gave a reply that every researcher 
should consider sticking on their fridge. “I did 
not think; I investigated.” m 
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hen David Wright leaves the Office of Research Integrity 
Wow at the US Public Health Service later this month, 

everyone will know just why he is walking out of the door. 
Wright's resignation letter to the assistant secretary of health, leaked 
last week, is unusually direct. “I’m offended as an American taxpayer 
that the federal bureaucracy — at least the part I’ve labored in — is so 
profoundly dysfunctional, he tells his former boss, ina note decorated 
with choice cuts of bureaucratic hubris. In one example, Wright says, 
he sought an evaluation of the support services available to the ORI, 
only to be told that “that had been tried a few years ago and the results 
were so negative that no further evaluations have been conducted”. 

Wright, a science historian and former research-integrity officer 
at Michigan State University in East Lansing, signs off with a cheery 
promise to publish a version of the daily log he 
kept at the ORI, “to share my experience and 
observations with my colleagues in government 
and with members of the regulated research com- 
munity”. That would be you, dear reader. 

Anyone who has dealt with Wright 
professionally will be taken aback to see the guy 
nailing his colours to the mast in this way. He just 
isn’t the rocking-the-boat type: a more courteous 
and polite official it would be difficult to meet. 
There is a profound feeling in circles interested 
in research misconduct that he is one of the 
good guys. 

“A lot of us are wondering where we go from 
here,” says Mark Frankel, head of the Scientific 
Responsibility, Human Rights and Law Program 
at the American Association for the Advance- 
ment of Science in Washington DC. As well one might, when the 
directorship of the world’s largest and best-established research- 
misconduct office — which lacked a permanent director for two years 
before Wright’s appointment in 2012 — has just imploded. 

The ORI was established 22 years ago in the wake of the David 
Baltimore case, in which allegations of fraud (later dismissed) 
rocked the laboratory of one of America’s most eminent biologists. 
The agency’s 25 staff are supposed to educate tens of thousands of 
researchers on proper research conduct, as well as overseeing inves- 
tigations into misconduct by researchers funded by the US National 
Institutes of Health (NIH), the world’s largest research agency. But it 
was built to deal with the handful of cases that it was assumed would 
arise. According to research published last December by Daniele 
Fanelli at the University of Edinburgh, UK (D. Fanelli PLoS Med. 10, 
e1001563; 2013), the proportion of allegations 


the ORI receives that are investigated and closed NATURE.COM 
has halved over 20 years. Wright’s resignation _ Discuss this article 
letter blows wide open long-standing doubts _ online at: 

about its capacity to deal with a caseload that, —_ go.nature.com/daGaed 


THE POSITION 
MUST BE FILLED 


PROMPTLY 
BY SOMEONE 


RESPECTED 
BY BOTHETHICISTS AND 
HEALTH RESEARCHERS. 


© W Wanted: Fraud-buster 
“8 with political antennae 


A mild-mannered man’s fiery resignation leaves a troubling vacancy at the 
world’s largest office for investigating scientific fraud, says Colin Macilwain. 


the available evidence suggests, should have expanded with the growth 
of the NIH itself. 

The administration of US President Barack Obama needs to get a 
grip on this before an explosive high-profile case — such as that of 
Andrew Wakefield and MMR vaccines in the United Kingdom — turns 
up and the ORI cant cope. If the administration doesnt do this, Con- 
gress just might. Senator Chuck Grassley (Republican, lowa) demanded 
in February that the ORI explain its lenient treatment of Dong-Pyou 
Han, a physician at Iowa State University in Ames, who was banned 
from seeking NIH funding for three years after falsifying data in AIDS 
vaccine trials that cost the agency US$19 million. 

It is not known how much Grassley’s strident demand for answers 
contributed to Wright’s departure. But attention from one of the most 
feared and respected voices on Capitol Hill can 
only intensify the political hot-house atmosphere 
that his resignation letter blames for the ORT’s 
troubles. 

These troubles go back a long way. The ORI 
sits under the assistant health secretary, instead 
of being properly independent like the inspector 
generals who keep an eye (imperfectly) on sci- 
entific fraud at the National Science Foundation 
(NSF) and other agencies. Its enforcement remit 
(of fabrication, falsification and plagiarism, or 
FEP) is too narrow, and its budget too small. 

Canada has already shown the way. In 2011, it 
set up the Tri-Agency Framework: Responsible 
Conduct of Research. It has a relatively large staff 
of eight, for a research system one-tenth the size 
of that of the United States, a remit that reaches far 
beyond FEP, and it is led by Susan Zimmerman, a tough lawyer unlikely 
to put up with dodgy academics whimpering about their “creativity”. 

Wright's resignation comes as a panel chaired by Robert Nerem, a 
bioengineer at the Georgia Institute of Technology in Atlanta, is in the 
final stages of the first National Academy of Sciences study on research 
integrity in 20 years. The report may recommend beefing up the ORI, 
broadening its remit beyond FFP and aligning its approach with that of 
the NSF and other agencies. But with the ORI leaderless, the academy 
report may be even more prone than usual to gathering dust. 

Ihave been an optimist on misconduct: some 18 months ago, I wrote 
that the global community was starting to get a handle on it. Wright's 
appointment was progress, and his departure is a setback. His position 
must be filled promptly by someone respected by both ethicists and 
health researchers. They will also need the skills to build bridges with 
Congress: a peripatetic master, perhaps, but one who can prevent the 
ORI from getting kicked around. = 


Colin Macilwain writes on science policy from Edinburgh, UK. 
e-mail: cfmworldview@gmail.com 
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Orange light boosts 
brain power 


The colour of the light you are 
exposed to could affect how 
well your brain functions. 
Gilles Vandewalle at the 
University of Liége, Belgium, 
Howard Cooper at the French 
National Institute of Health 
and Medical Research in Bron 
and their colleagues exposed 
16 volunteers to blue, green or 
orange light for 10 minutes, 
then kept them in darkness for 
70 minutes. The researchers 
recorded the volunteers’ brain 
activity while the participants 
performed various cognitive 
tests under green light. People 
exposed to orange light had 
more activity in the prefrontal 
cortex, which is involved in 
higher cognitive function, than 
those exposed to blue light. 
The findings suggest that 
a light-sensitive protein in 
the retina called melanopsin, 
which is primed by orange-red 
light and is not involved in 
vision, can influence cognition. 
Proc. Natl Acad. Sci. USA http:// 
doi.org/rwf (2014) 


Badgers roam 
many miles 


Badgers could be travelling 

much greater distances 

than previously supposed, 

a finding that might affect 

how the animals are culled 

or vaccinated to prevent the 

spread of bovine tuberculosis. 
Andrew Byrne at University 


Selections from the 
scientific literature 


Nanotubes rev up photosynthesis 


Carbon nanotubes that infiltrate leaves can boost 
photosynthetic activity and even turn plants into 


chemical sensors. 


Michael Strano and his colleagues at the 
Massachusetts Institute of Technology in 
Cambridge used near-infrared microscopes 
to track single-walled carbon nanotubes in 
the leaves of Arabidopsis plants (pictured) 
and in extracted chloroplasts, the 
photosynthesizing organelle in plants. They 
found that the nanotubes integrate themselves 
into the chloroplasts’ outer envelope. The 


College Dublin and his 
colleagues marked 963 
European badgers (Meles meles; 
pictured) between 2008 and 
2012 and later recaptured them 
to determine their movements 
across 755 square kilometres 
of County Kilkenny in Ireland. 
Roughly half of the recaptures 
took place at the same burrows 
where an individual was 
originally captured. Of the 
remaining animals, 43% 
travelled less than 1 km. But 
some ventured relatively long 
distances: 5% of movements 
were over 7.3 km and one 


The authors suggest that 
officials seeking to confine 
disease to specific areas 


animal travelled over 22 km. 


semiconducting carbon nanotubes tripled 
photosynthetic activity in extracted chloroplasts 


compared with those without nanoparticles, by 


enhancing electron transport. 

In addition, the authors showed that leaves 
containing fluorescent carbon nanotubes 
designed to stop glowing in the presence of the 
pollutant nitric oxide also stopped fluorescing 


when exposed to the pollutant. Carbon 


should extend buffer zones by 
at least 7.3 km. 

J. Anim. Ecol. http://doi.org/rwh 
(2014) 


Killing dingoes 
has side effects 


Efforts to control populations 
of a top predator in Australia 
have had unintended ecological 
effects — decreased vegetation 
and fewer small mammals. 

In southeastern Australia, 
government officials have 
been poisoning dingoes (Canis 
lupus dingo) to reduce the 
predator's impact on livestock. 
Mike Letnic at the University 
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nanotubes could allow plants to detect other 
chemicals such as pesticides, the researchers say. 
Nature Mater. http://doi.org/rxc (2014) 


of New South Wales in Sydney 
and his team compared seven 
sites in the eucalyptus forests 
of southeastern Australia, in 
which dingoes are controlled, 
with the same number of 
ecologically similar sites where 
the animals are left alone. 
Areas where dingoes are killed 
contained more of the animal's 
prey, such as kangeroos, but 
less understory vegetation and 
fewer small mammals. 

As the prey populations grow 
in areas with dingo control, 
these animals consume more 
vegetation, reducing the cover 
under which small mammals 
can hide from their predators. 
Proc. R. Soc. B 281, 20133094 
(2014) 
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How proteins find 
their DNA target 


Researchers have captured 
images of individual proteins 
searching for their DNA- 
binding sites, and have 
quantified parts of this process. 
Zhe Liu and his team at 
the Howard Hughes Medical 
Institute in Ashburn, Virginia, 
looked at transcription factors 
— proteins that bind to specific 
genes to regulate their activity 
— in individual, living mouse 
embryonic stem cells. The 
researchers used microscopes 
to track single molecules of 
Sox2 and Oct4, key stem-cell 
gene regulators. They found 
that the two molecules use 
trial and error to seek out their 
binding sites, by colliding 
with DNA, diffusing away 
and colliding again roughly 
90 times before finding their 
targets. The process takes 
about 6 minutes and the factors 
remain bound to the target sites 
for about 12 to 14 seconds. 
Sox2 collides and then slides 
along short stretches of DNA. 
Sox2 also binds before Oct4, 
helping it to find its home. 
Cell 156, 1274-1285 (2014) 


More of Greenland 
is starting to melt 


Once-stable glaciers in 
northeast Greenland are now 
shrinking as a result of regional 
warming. 

Shfagat Khan at the 
Technical University of 
Denmark in Kongens Lyngby 
and his colleagues analysed 
satellite and airborne-elevation 
measurements of three 
glaciers that comprise the 
600-kilometre-long northeast 
Greenland ice stream. The 
measurements revealed that, 
after more than a quarter 
of a century of stability, this 
stream began to thin between 
2003 and 2006. The team 
then looked at ocean and 
atmospheric temperature data 
and determined that warmer 
air temperatures, beginning 
in 2003, probably led to the 


melting of sea ice in the region, 
allowing the glaciers to flow 
more freely into the ocean. 
These ice-loss measurements 
exceed projections made for 
this region, and so models 
of global sea-level rise may 
underestimate Greenland’s 
contribution over the coming 
century. 
Nature Clim. Change http://doi. 
org/rxb (2014) 


Warming climate 
threatens crops 


Climate change could result in 
decreasing yields of staple food 
crops in most parts of the world 
from the 2030s onwards. 
Andrew Challinor at the 
University of Leeds, UK, and 
his team compared the results 
of more than 1,700 simulations 
of climate change impacts 
— with and without various 
adaptation strategies — on 
annual wheat, rice and maize 
(corn) yields. The data suggest 
that, without adaptation, 
average food-crop supplies 
will decline by around 5% per 
degree Celsius of warming. 
Losses of wheat in temperate 
climates and rice in tropical 
environments can be avoided, 
or even reversed, by changing 
factors such as crop varieties, 
planting time, fertilizer use and 
irrigation. But such adaptation 
measures may provide little 
or no benefit for maize yields, 
particularly in tropical regions, 
the authors warn. 
Nature Clim. Change http://doi. 
org/rw9 (2014) 


Hominin explorers 
were poor planners 


Hominin migrations, such as 
those out of Africa, might have 
been led by indivduals with low 
levels of foresight. 

A team led by Colin Wren 
at McGill University in 
Montreal, Canada, modelled 
the migratory behaviour 
of individuals, based on 
the complexity of their 
environments and their 
cognitive ability to assess and 
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3D printing goes nanoscale 


3 HIGHLY READ 
on pubs.acs.org 
in February 


Researchers have used a three- 
dimensional (3D) printing technique to 
stack nanometre-scale polymer threads on 


top of one another, forming a tiny wall. 

A promising 3D nanoscale printing method involves 
spinning nanofibres out of a liquid under a strong electrical 
field. But only certain shapes have been printed this way, 
because the fibres tend to electrically repel each other. To 
solve this problem, Minhee Lee and Ho- Young Kim of 
Seoul National University fired polymer nanofibres onto 
a metal template sitting on top of a grounding plate that 
quickly drains the deposited fibres of charge. The fibres then 
spontaneously attract each other, and stack to form a free- 


standing wall. 


The technique cannot yet be used to print objects with 
sharp corners but could still be useful for building, for 
example, tiny transistors, the authors say. 


Langmuir 30, 1210-1214 (2014) 


seek out better environments. 
The model predicted that 
those with poorer foresight 
were more likely to stumble on 
a better habitat, and therefore 
disperse into new areas. 
Furthermore, homogeneous 
environments fostered 
exploration because the more 
constant level of resources 
allowed explorers to travel 
further than those in a more 
heterogeneous setting. 
J. Hum. Evol. http://doi.org/rwd 
(2014) 


Micro-pump with 
no moving parts 


A pump can propel liquids 
along millimetre-sized 
channels at high speeds with no 
mechanical parts. 

The pump, designed by 
Khashayar Khoshmanesh at 
the Royal Melbourne Institute 
of Technology University in 
Australia and his colleagues, 
is a droplet of metal — an 
alloy of gallium, indium and 
tin — that is held loosely in 
a spherical chamber in the 
middle of a Plexiglas channel. 
A slow chemical reaction in 
the metal droplet results in 


gallate anions diffusing to the 
droplet’s surface. When a low- 
power electric field is applied, 
the rearrangement of charges 
at the droplet’s surface causes 
it to shift in its constrained 
space in such a way that the 
surrounding solution flows 
past it (pictured). 

The pump might be useful 
in microscopic machines, the 
researchers say. 

Proc. Natl Acad. Sci. USA 111, 
3304-3309 (2014) 
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SEVEN DAYS nescnss 


Europe HFC ban 


The European Parliament 

has approved a ban on potent 
greenhouse gases used in 
some cooling systems, aerosol 
sprays and synthetic foams. 
Once in the atmosphere, 

the warming effect of 
hydrofluorocarbon (HFC) 
gases is thousands of times 
that of carbon dioxide, and 
emissions of HFCs across 
Europe have increased by 60% 
since 1990. The legislation 
aims to cut use of the gases by 
79% over the next 15 years. 
European Union member 
states will vote on the new 
rules in April. 


GM crops detected 
Small amounts of genetically 
modified (GM) crops are 
increasingly being detected 

in traded food and feed, says 
the Rome-based United 
Nations Food and Agricultural 
Organization (FAO). In the first 
survey of its kind, published 
on 13 March, the FAO 

reports that, across 75 of its 
member countries, regulators 
identified 198 incidents in 
which low levels of GM crops 
were detected in supposedly 
non-GM crops. Most cases 
occurred in the latter years of 
the survey between 2009 and 
2012, and involved linseed, 
rice, maize (corn) and papaya. 
Once detected, most shipments 
were destroyed or returned 

to the country of origin. The 
FAO attributes the rise to 
increased global production of 
GM crops and better detection 
technology. 


Dangerous research 
Animal-rights extremists are 
increasingly making their 
attacks personal. A 12 March 
report by the Federation 

of American Societies for 
Experimental Biology says 

that since 2000, nearly 50% 

of attacks by extremists have 


Illegal fires ravage Sumatra 


when the nation’s government declared a 

state of emergency in response to the worst 

air pollution ever recorded in southeast Asia. 
Roughly half of the current fires are on land 
managed by oil-palm and logging companies, 
even though using fire to clear land is illegal in 
Indonesia, a WRI analysis says. 


The number of fires raging across the 
Indonesian island of Sumatra (pictured) last 
week is the highest in recent years. Using 
satellite data, the World Resources Institute 
(WRI) in Washington DC detected 3,101 fires 
on the island. This exceeds the 2,643 fires 
detected in the same region in June last year, 


targeted individuals rather than 
institutions, compared to 9% 
over 1990-99, Extremists are 
harassing researchers at home 
and destroying their property, 
the report says. It recommends 
that researchers protect 
themselves by becoming 
familiar with local laws and 
keeping their personal contact 
information private. 


Undefined illness 
The US Institute of Medicine 
cannot define Gulf War illness, 
which plagues veterans of the 
1990-91 war with symptoms 
of fatigue, pain, memory loss 
and gastrointestinal disorders. 
The diversity of symptoms and 
the lack of a diagnostic test 
prevented a single definition, 
said a report published on 

12 March by the institute, 
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based in Washington DC. 

The Department of Veterans 
Affairs had asked for a more 
specific definition to guide 
research and care for veterans. 
The department argues that the 
condition should not be treated 
primarily as a mental illness. 


Swiss grants crisis 


The Swiss government has 
stepped in to offer substitute 
grants in the wake ofa 

move that bars researchers 
in the country from 
applying to the European 
Research Council (ERC). 
Switzerland lost its status as 
an associate partner in the 
European Union's €80-billion 
(US$110-billion) Horizon 
2020 funding programme 
after it imposed curbs on 
immigration (see Nature 
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506, 277; 2014). The change 
hit hundreds of would-be 
ERC applicants based in the 
country. The Swiss National 
Science Foundation in Bern 
is now offering a number 

of replacement grants of 
comparable size to those 
offered by the ERC. See 
go.nature.com/bfldcw for 
more. 


Climate consensus 
The American Association for 
the Advancement of Science 
issued a report on 18 March 
called What We Know, 
summarizing the science 
behind climate change (see 
go.nature.com/ewy8cd). It 
follows an overview of climate 
science by the UK Royal 
Society and the US National 
Academy of Sciences that was 


AZWAR/EPA/CORBIS 


% launched on 27 February (see 

2 go.nature.com/puvn4v). The 

= documents come ahead of 

& the Intergovernmental Panel 

2 on Climate Change’s report 
focusing on the impacts of 
climate change, which will be 
released at the end of March. 


Inflation evidence 

A telescope at the South Pole 
has revealed strong evidence 
that the Universe went through 
a period of rapid inflation just 
after the Big Bang. To great 
excitement, a collaboration 

led by researchers at the 
Harvard-Smithsonian Center 
for Astrophysics in Cambridge, 
Massachusetts, announced 
that the telescope had spotted 
the influence of gravitational 
waves (ripples in space that 
inflation would have caused) 
on the cosmic microwave 
background, the radiation 
released after the Big Bang. See 
page 281 and go.nature.com/ 
lruz8e for more. 


Carp invasion 

Asian carp that have run 
amok in the rivers of the 
midwestern United States 

are spawning farther north 

in the Mississippi River than 
previously recorded, the US 
Geological Survey (USGS) said 
on 11 March. USGS scientists 
found embryos (pictured) of 
bigheaded carp about to hatch 
in the river running through 


SOURCE: WIPO 


last year, according to the 
World Intellectual Property 
Organization in Geneva, 


TREND WATCH | 


International patent applications 
grew 5.1% to more than 200,000 


Switzerland. The United States 
exceeded its previous record, 
set before the financial crisis, 
and China saw the largest 
growth with a 16% surge. The 
University of California (46% 
rise to 398 applications) and 
the Massachusetts Institute of 
Technology in Cambridge (49% 
rise to 217) were the top-filing 
educational institutions. 


Lynxville, Wisconsin. The fish 
were originally imported from 
southeast Asia to the southern 
United States for aquaculture 
and to help clean ponds in 
wastewater-treatment facilities. 
But they escaped and bred, and 
are now causing environmental 
and economic damage. 


Research apology 


The Japanese research institute 
that is home to several authors 
of two controversial stem- 

cell papers has apologized 

for errors in the research. 

On 14 March, the RIKEN 
institute in Tokyo announced 
the preliminary findings of an 
investigation into work led by 
Haruko Obokata that describes 
a method for reprogramming 
differentiated mouse cells into 
a pluripotent embryonic-like 
state by stressing them. The 
papers, published in Nature, 
were criticized for using 
several duplicated images and 
because their results have not 
been reproduced. See page 283 
for more. 


CHINA'S PATENTS SURGE 


PEOPLE 


Cordova confirmed 
The US Senate confirmed 
astrophysicist France Cordova 
as head of the National 
Science Foundation (NSF) 

on 12 March. The foundation 
funds basic non-medical 
research with a budget of 
US$7.2 billion; Cérdova’s 
confirmation comes just over 
ayear after former director 
Subra Suresh resigned mid- 
term. Cérdova was most 
recently chairwoman of the 
Smithsonian Institution's 
Board of Regents anda 
member of the National 
Science Board, the panel that 
oversees the NSE See page 285 
for more. 


Integrity red tape 
The departing director of 

the US Office of Research 
Integrity (ORI), David 
Wright, has accused the 
Department of Health and 
Human Services — in which 
the ORI sits — of “remarkably 
dysfunctional” bureaucracy 
in a resignation letter. 
Wright, who is leaving the 
directorship on 27 March 
after two years in charge, 
described what he said was 
the low priority afforded his 
office by higher management; 
for example, the ORI’s head- 
of-education post has been 
unfilled for 16 months. The 
letter was published online by 
Science on 12 March. Wright 


China overtook Germany in 2013 to reach third place in 


international patent filings. 
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SEVEN DAYS | THIS WEEK | 


COMING UP 


25-28 MARCH 
Marking the 100th 
anniversary of the birth 
of Norman Borlaug, 
who saved millions from 
starvation by developing 
higher-yielding crops, 
the International 

Maize and Wheat 
Improvement Center 

in Mexico is holding a 
conference to discuss 
the state of research on 
wheat. 
go.nature.com/hrne9¢g 


26-28 MARCH 
Physicists debate a 
suitable landing site for 
the ExoMars rover at a 
meeting at the European 
Space Agency's 
European Space 
Astronomy Centre near 
Madrid. The mission 
plans to land a rover on 
the red planet in 2018. 
go.nature.com/i5n5r2 


said that bureaucracy “sucks 
away time and resources that 
we might better use to meet 
our mission”. See page 275 
for more. 


| _BUSINESS 
BP drilling back on 


Energy giant BP will once 
more be allowed to bid for 

US leases to drill for oil 

and gas offshore, after 

the company reached an 
agreement with the US 
Environmental Protection 
Agency (EPA) on 13 March. 

It had been barred from new 
federal contracts in November 
2012, two years after its 
Deepwater Horizon drilling 
rig in the Gulf of Mexico was 
involved in a massive oil spill. 
The agreement comes with 
conditions — for example, BP 
must improve its practices and 
an EPA-approved auditor will 
review them annually. 


> NATURE.COM 
For daily news updates see: 
Www.nature.com/news 
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STEFFEN RICHTER/HARVARD UNIVERSITY 


NEWSIN FOCUS | 


study found to contain 


Acid-bath stem-cell 
“serious errors’ p.283 


as the head of US National 


Astrophysicist approved 
Science Foundation p.285 


new life p.286 


Dam release set to 
flood Colorado River with 


Sequencing costs 


The BICEP2 instrument (foreground) at the South Pole has detected signs of ripples from the Universe’s first moments. 


COSMOLOGY 


Telescope captures view 
of gravitational waves 


Images of the infant Universe reveal evidence for rapid inflation after the Big Bang. 


BY RON COWEN IN CAMBRIDGE, 
MASSACHUSETTS 


stronomers have peered back to 
Aw the dawn of time and found 
what seems to be the long-sought 
‘smoking gun for the theory that the Universe 
underwent a spurt of wrenching, exponential 


growth called inflation during the first tiny 
fraction of a second of its existence. 


Using a radio telescope at the South Pole, 
the US-led team has detected the first evi- 
dence of primordial gravitational waves, 
ripples in space that inflation generated 
13.8 billion years ago when the Universe first 
started to expand. 

The telescope captured a snapshot of the 
waves as they continued to ripple through 
the Universe some 380,000 years later, when 
stars had not yet formed and matter was still 


scattered across space as a broth of plasma. 
The image was seen in the cosmic micro- 
wave background (CMB), the glow that radi- 
ated from that white-hot plasma and that 
over billions of years of 
cosmic expansion has 
cooled to microwave 
energies. 

The fact that infla- 
tion, a quantum 


For more on cosmic 
ripples from the Big 
Bang: 
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get close to the magic AY 
number p.294 yg 


| NEWS IN FOCUS 


> phenomenon, produced gravitational 
waves demonstrates that gravity has a 
quantum nature just like the other known 
fundamental forces of nature, experts say. 
Moreover, it provides a window into inter- 
actions much more energetic than are 
accessible in any laboratory experiment. In 
addition, the way that the team confirmed 
inflation is itself of major significance: it is 
the most direct evidence yet that gravita- 
tional waves — a key but elusive prediction 
of Albert Einstein's general theory of relativ- 
ity — exist. 

“This is a totally new, independent piece of 
cosmological evidence that the inflationary 
picture fits together,” says theoretical physicist 
Alan Guth of the Massachusetts Institute of 
Technology (MIT) in Cambridge, who pro- 
posed the idea of inflation in 1980. He adds that 
the study is “definitely” worthy of a Nobel prize. 


INSTANT INFLATION 

Guth’s idea was that the cosmos expanded at 
an exponential rate for a few tens of trillionths 
of trillionths of trillionths of seconds after the 
Big Bang, ballooning from subatomic to foot- 
ball size. Inflation solves several long-standing 
cosmic conundrums, such as why the observ- 
able Universe appears uniform from one end 
to the other. Although the theory has proved 
to be consistent with all cosmological data 
collected so far, conclusive evidence for it has 
been lacking. 

Cosmologists knew, however, that infla- 
tion would have a distinctive signature: 
the brief but violent period of expansion 
would have generated gravitational waves, 
which compress space in one direction 
while stretching it along another (see ‘Rip- 
ple effect’). Although the primordial waves 
would still be propagating across the Uni- 
verse, they would now be too feeble to detect 
directly. But they would have left a distinctive 
mark in the CMB: they would have polarized 
the radiation in a curly, vortex-like pattern 


RIPPLE EFFECT 


When a gravitational wave passes through space, it 


compresses it in one direction and stretches it in another, 


both at right angles to the wave's direction (red line). The 
effects of the wave moving along a blue tube are shown. 
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COSMIC CURL 


The BICEP2 instrument observed a faint but distinctive twisting pattern, 
or spin, known as a curl or B-mode, in the polarization of the cosmic 

microwave background. This is the first evidence for gravitational waves 
generated by rapid inflation of the Universe some 13.8 billion years ago. 
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known as the B mode (see ‘Cosmic curl’). 

Last year, another telescope in Antarctica — 
the South Pole Telescope (SPT) — became the 
first observatory to detect a B-mode polariza- 
tion in the CMB (see Nature http://doi.org/ 
rwt; 2013). That signal, however, was over 
angular scales of less than one degree (about 
twice the apparent size of the Moon in the 
sky), and was attributed to how galaxies in 
the foreground curve the space through which 
the CMB travels (D. Hanson et al. Phys. Rev. 
Lett. 111, 141301; 2013). But the signal from 
primordial gravitational waves is expected to 
peak at angular scales between one and five 
degrees. 

And that is exactly what John Kovac of the 
Harvard-Smithsonian Center for Astrophysics 
(CfA) in Cambridge, Massachusetts, and his 
colleagues now say they have detected, using 
an instrument dubbed BICEP2 that is located 


Direction of 
propagation 
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just metres away from its competitor, the SPT. 

Detecting the tiny B mode required 
measuring the CMB with a precision of one 
ten-millionth of a kelvin and distinguish- 
ing the primordial effect from other possible 
sources, such as galactic dust. 

“The key question,” says Daniel Eisenstein, 
an astrophysicist at the CfA, “is whether there 
could be a foreground that masquerades like 
this signal”. But the team has all but ruled out 
that possibility, he says. First, the researchers 
were careful to point BICEP2 — an array of 
512 superconducting microwave detectors — 
at the Southern Hole, a patch of sky that is 
known to contain only tiny amounts of such 
emissions. They also compared their data 
with those taken by an earlier experiment, 
BICEP 1, and showed that a dust-generated 
signal would have had a different colour and 
spectrum. 

Furthermore, data taken with a newer, more 
sensitive polarization experiment, the Keck 
array, which the team finished installing at the 
South Pole in 2012 and will continue operating 
for two more years, showed the same charac- 
teristics. “To see this same signal emerge from 
two other, different telescopes was for us very 
convincing,’ says Kovac. 

“The details have to be worked out, but 
from what I know it’s highly likely this is what 
we've all been waiting for,’ says astronomer 
John Carlstrom of the University of Chicago, 
Illinois, who is the lead researcher on the SPT. 
“This is the discovery of inflationary gravita- 
tional waves.” 


SOLID SIGNATURE 

Cosmologist Marc Kamionkowski adds: 
“To me, this looks really, really solid.” He 
was one of the first cosmologists to cal- 
culate what the signature of primordial 


SOURCE: BICEP2 COLLABORATION 


SOURCE: MARKUS POSSEL/EINSTEIN-ONLINE.INFO 


gravitational waves should look like in 
the CMB. The findings are “on a par with 
dark energy, or the discovery of the CMB 
— something that happens once every 
several decades”, says Kamionkowski, 
who is at Johns Hopkins University in 
Baltimore, Maryland. 

The strength of the signal measured 
by BICEP2, although entirely consist- 
ent with inflation, initially surprised the 
researchers because it is nearly twice as 
large as estimated from previous experi- 
ments. According to theory, the intensity 
of a B-mode signal reveals how fast the 
Universe expanded during inflation, and 
therefore suggests the energy scale of the 
cosmos during that epoch. The data pin- 
point the time when inflation occurred 
— about 10~” seconds into the Universe's 
life — and its temperature at the time, 
corresponding to energies of about 
10”° gigaelectronvolts, says cosmologist 
Michael Turner of the University of Chi- 
cago. That is the same energy at which 
three of the four fundamental forces of 
nature — the weak, strong and electro- 
magnetic force — are expected to become 
indistinguishable from one another in a 
model known as the grand unified theory. 

Because inflation took place in the realm 
of quantum physics, seeing gravitational 
waves arise from that epoch provides “the 
first-ever experimental evidence for quan- 
tum gravity’, says MIT cosmologist Max 
Tegmark — in other words, it shows that 
gravity is at heart a quantum phenome- 
non, just like the other three fundamental 
forces. Physicists, however, have yet to fully 
understand how to reconcile general rela- 
tivity with quantum physics from a theory 
standpoint. 

The researchers reported the findings 
on 17 March ata press briefing at the CfA, 
held just after they described their results 
to scientists in a technical talk. The team 
also released several papers describing the 
results. In so doing, it seems to have beaten 
the SPT and also several other groups 
racing to find the fingerprint of inflation 
using an assortment of balloon-borne and 
ground-based experiments and one satel- 
lite, the European Space Agency’s Planck 
spacecraft. 

More-extensive maps of the B-mode 
polarization, and especially a full-sky 
survey, which the Planck telescope may be 
able to obtain later this year, should provide 
more clues about how inflation unfolded 
and what drove it. In addition to looking 
farther back in time than ever before, the 
discovery “is opening a window a trillion 
times higher in energy than we can access 
with the Large Hadron Collider’, the world’s 
premiere atom smasher, notes cosmologist 
Avi Loeb of the CfA, who is not part of the 
BICEP2 team. = 


CELL BIOLOGY 


IN FOCUS | NEWS 


Stem-cell method 
faces fresh questions 


Papers describing acid-bath technique under more scrutiny 
after institute’s investigation finds errors in methodology. 


BY DAVID CYRANOSKI 


he veracity of two papers that detailed a 
[inet to reprogram mature cells into 

an embryonic state by exposing them 
to stress has come under more pressure. Days 
after the first author’s institute reported “serious 
errors” in the papers’ methodology, questions 
were raised over the same researcher's doctoral 
dissertation and the cells used in the study. 

On 14 March, RIKEN, Japan’s largest 
research organization, which runs the Center 
for Developmental Biology (CDB) in Kobe 
where first author Haruko Obokata and several 
of her co-authors work, announced the interim 
findings of its investigation into allegations 
of irregularities in the method. One RIKEN 
investigator advised them to retract the papers. 
Then, in further developments, Obokata called 
into question the quality of her own doctoral 
thesis, which is already under investigation by 
the university that granted it. And a co-author 
on the reprogramming papers says that he 
is sending some of the cells produced in the 
experiments for independent verification. 

The two papers made headlines around the 
world when they were published in Nature 
on 30 January (H. Obokata et al. Nature 505, 
641-647 and 676-680; 2014). The technique, 
which the team called stimulus-triggered acqui- 
sition of pluripotency (STAP), is important 
because cells reprogrammed into an embryonic 
state are ideal for studying the development of 
disease or the effectiveness of drugs. 

But within weeks, the papers were attacked 
by scientists over their use of several duplicated 
images and by those who could not reproduce 
the work, prompting RIKEN to investigate. 

At the press conference, a five-person panel, 
including RIKEN director and Nobel-prize- 
winner Ryoji Noyori, noted six problems. Two 
were dismissed as unintentional mistakes. Four 
others — including an image ofan electropho- 
resis gel that seemed to have hada lane added in 
later, and plagiarism in part of the methods sec- 
tion — were deemed more serious and are still 
under investigation. The panel offered no clear 
answers about whether the STAP phenomenon 
is real, but noted that RIKEN co-author Hitoshi 
Niwa was attempting to replicate the method. It 
added that there were no signs of fraud. 

Panel member Masatoshi Takeichi, director 


of the CDB, also told reporters that the three 
co-authors from the institute — Obokata, Niwa 
and Yoshiki Sasai — had agreed to retract the 
paper at his request. But a statement in Japanese 
by the trio, delivered at the press conference, 
said only that they were “considering a retrac- 
tion and contacting outside authors to discuss 
that possibility”. Charles Vacanti, of Harvard 
Medical School in Boston, Massachusetts, who 
is the senior corresponding author on the first 
of the two papers (H. Obokata et al. Nature 505, 
641-647; 2014), has made it clear that he has no 
intention of retracting unless there is compel- 
ling evidence that the data are incorrect. 

To add confusion, the first 20 pages of Oboka- 
tas thesis, completed in 2011 at Waseda Univer- 
sity in Tokyo, were found by Nature News to be 
taken from a US National Institutes of Health 

primer on stem cells, 


“What did I and one image in the 
injectintothose results section has 
blastocysts? been reproduced 
This is what I from a commercial 


website without a 
citation. Moreover, 
Vacanti, who was listed on the thesis as a mem- 
ber of the examination committee that approved 
it, told Nature News: “I was not presented with 
or asked to read a copy of her dissertation” 

Last week, Obokata wrote to an unnamed 
professor at Waseda University indicating that 
she wanted to retract the thesis. She has not, 
however, formally requested a retraction. 

A desire to resolve the STAP controversy has 
led to an investigation into the identity of the 
cells in the papers. Teruhiko Wakayama of the 
University of Yamanashi, a senior author on the 
second paper (H. Obokata et al. Nature 505, 
676-680; 2014), helped to test the pluripotency 
of Obokata’s STAP stem cells by injecting them 
into mouse embryos. By turning into different 
cell types within these mice, the cells proved 
they had the developmental capacity that STAP 
promises. But he has now sent the cells that 
Obokata gave him to an independent institute 
for genetic analysis to see if they are really STAP 
cells. “What did I inject into those blastocysts?” 
Wakayama asks. “This is what I want to know 
more than anything else.” He hopes to have his 
answer in the next few months. = 


want to know.” 


Nature News is editorially independent of the research 
publications section of Nature. 
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NSB 


IN FOCUS | NEWS 


Incoming NSF director faces 
challenges in Congress 


Former Purdue University president France Cordova inherits an agency at a crossroads. 


BY JESSICA MORRISON 


y all accounts, astrophysicist France 
Bees thrives in the face of a chal- 

lenge. As NASA’ chief scientist in the 
mid-1990s, she helped then-administrator 
Daniel Goldin to push his ‘faster, better, cheaper’ 
vision for space research. Later, as chancellor of 
the University of California, Riverside, she won 
approval for a long-sought medical school. And 
during her 2007-12 tenure as president of Pur- 
due University, she helped to double research 
funding despite a global recession. 

That persistence should serve her well as 
director of the US National Science Founda- 
tion (NSF), the position she assumed after 
her appointment was confirmed by the Sen- 
ate on 12 March. The US$7.2-billion agency, 
which funds basic non-medical research, has 
seen its budget stagnate in recent years and is 
now facing attacks on its peer-review system 
and social-science division from conservative 
members of Congress. 

“The toughest challenge that she’s fac- 
ing right now is one that is faced by all of 
the research institutions that are funded by 
taxpayer dollars — relevance,” says Internet 
pioneer Vinton Cerf, vice-president of Google 
and a member of the National Science Board, 
which oversees the NSF. 


POLITICAL EXPERIENCE 
Cordova studied English at Stanford Uni- 
versity in California, and received a physics 
doctorate from the California Institute of 
Technology in Pasadena in 1979. She then 
spent a decade at Los Alamos National Labo- 
ratory in New Mexico, studying X-ray and 
y-ray sources, before making her first foray 
into administration at Pennsylvania State Uni- 
versity in University Park, where she served as 
head of the astronomy department from 1989 
to 1993. Her subsequent moves to NASA, the 
University of California, Riverside, and Pur- 
due in West Lafayette, Indiana, saw her take on 
positions of increasing responsibility. 
Cordova is no stranger to the NSF or its 
struggles. Until her confirmation, she was a 
member of the National Science Board, and 
chaired the governing board of the Smithso- 
nian Institution in Washington DC, the world’s 
largest museum complex. Her time on the sci- 
ence board is likely to shorten her learning 


France Cordova became head of the National 
Science Foundation on 12 March. 


curve as she assumes the NSF’s top job, says 
environmental microbiologist Rita Colwell, 
who led the agency from 1998 to 2004. 

Goldin, now chief executive of the Intelli- 
sis Corporation in San Diego, California, says 
that Cordova has already proved her ability to 
operate in the thorny political environment of 
Washington DC. At NASA — her first govern- 
ment post — she was pivotal to implement- 
ing Goldin’s controversial plan to expand the 
agency’s scientific reach and stretch its budget 
by backing smaller, less expensive science and 
exploration missions. “She developed really 
good, solid relationships with staff in the 
House and Senate, and the White House,” he 
says. “She is able to communicate with people 
on the issues and not emotions.” 

That experience, and her subsequent work 
as a university administrator, is not far from 
Cordova’s mind as she assumes her new role. 
“T don't feel like I’ve ever left government; she 
told reporters the day after her Senate con- 
firmation. “I’ve been involved in one way or 
another — taking money or giving it” 

With austerity a priority in Washington DC, 
Cordova is already thinking about how to make 
the NSF’s research dollars go further. “Budg- 
ets are limited, and that’s always a challenge,” 


she tells Nature. “But I think that government 
organizations like NSF could work toward 
more partnerships with industry, foundations 
and private donors, to leverage the amount of 
money that each has.” 


CONGRESSIONAL CAMPAIGN 

Cérdova has a long-term goal to further the 
foundation’s emphasis on interdisciplinary 
and international research programmes — a 
focus of her immediate predecessor as NSF 
director, engineer Subra Suresh, who resigned 
alittle over a year ago. But she also faces a more 
immediate challenge: winning over some of 
her agency’s staunchest critics, members of 
Congress who are questioning the quality of 
the research that the NSF supports. 

On 10 March, Republican members of the 
House of Representatives’ science committee 
introduced a bill that would require the NSF 
to certify each grant it awards as serving the 
“national interest” in one of six categories (see 
Nature http://doi.org/rxg; 2013). The bill, put 
forward by congressmen Lamar Smith (Repub- 
lican, Texas) and Larry Bucshon (Republican, 
Indiana), also seeks to slash funding for the 
NSF’s social-science directorate by 22%, to 
$200 million, while increasing spending on 
many of the agency’s other major research 
divisions. That would commit more of the 
NSF’s limited funds to producing discoveries 

that drive economic 


“We have to growth, says Smith. 
better explain The debate over 
why we do what the bill in the House 
we do.” and the science com- 
munity echoes a sim- 


ple sentiment that Cordova heard over and 
over again in recent weeks as she spent time 
getting to know key members of Congress and 
their aides while waiting for the Senate to con- 
firm her new post. “We have to better explain 
why we do what we do,’ she says of the NSF. 

Cérdova's operating style may be just what 
is needed to get the message across. Former 
colleagues say that she draws on her diplomacy 
and innate curiosity to build a broad base of 
support for her goals. “She surrounds herself 
with clear thinkers, and she solicits their opin- 
ions,” says Philip Low, a chemist who directs 
Purdue’s Center for Drug Discovery. “No 
one is the sole repository of wisdom, and she 
recognizes that.” m 
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Water diversions in two countries mean that the mighty Colorado River peters out before it reaches Mexico’s Gulf of California. 


Water returns to arid 
Colorado River delta 


US- Mexico agreement paves the way for a rare 


environmental test. 


BY ALEXANDRA WITZE 


ne of North America’s most iconic 
() rivers is about to undergo an unprec- 

edented experiment in ecological 
engineering. 

On 23 March, operators at the Morelos 
Dam along the US-Mexico border near Yuma, 
Arizona, will open the gates and begin releas- 
ing water downstream. The goal is to dampen 
broad swathes of the arid Colorado River delta 
for the first time in decades, allowing new 
cottonwood and willow trees to germinate and 
restore small patches of riparian habitat. 

The move, which follows bitter international 


DINO OF THE WEEK 
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battles over water rights, will mark the first 
time that the United States and Mexico have 
put water back into the parched riverbed for 
environmental purposes. It is both a practi- 
cal and a symbolic victory for conservation- 
ists who have fought to restore what was once 
800,000 hectares of lush wetlands, as well as a 
rare opportunity for ecologists worldwide to 
watch what happens. 

The mighty Colorado rises on the western 
slope of the Rocky Mountains and drains 
seven US and two Mexican states along its 
2,300-kilometre course (see ‘River run’). 
Before the 1930s, when dams began to throttle 
the river, its water ran unfettered into the Gulf 
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of California. But most was soon diverted to 
slake the thirst and agricultural fields of mil- 
lions of people in the American Southwest. 
A 1944 international treaty granted Mexico 
just 10% of the river's original flow; vast delta 
wetlands shrivelled to patches of vegetation 
clinging to sandy plains. 

In 2012, officials drew up an adden- 
dum to the original water treaty. Known as 
Minute 319, the agreement lays out how the 
United States and Mexico will share water 
surpluses and shortages until the end of 2017. 
But it also mandates the experimental release 
of what it calls “water for the environment”. 

That was the call to action for a group of 
researchers from universities, government 
agencies and non-governmental organizations 
who had been working for years on delta con- 
servation (E. P. Glenn, K. W. Flessa and J. Pitt 
Ecol. Eng. 59, 1-6; 2013). They calculated how 
much water should be released, and over what 
period of time. Asa result of their recommen- 
dations, over nearly eight weeks, dam opera- 
tors will allow some 130 million cubic metres 
of water to travel downstream. 

That is less than 1% of the river’s average 
annual flow, but it is still significant, says 
Francisco Zamora Arroyo, director of the 
Colorado River Delta Legacy Program at the 
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The river supplies water 
to roughly 40 million 
people in seven US states 
and two states in Mexico. 


Sonoran Institute in Tucson, Arizona. “It’s 
going to allow us to learn from this event and 
better determine the water needs for the delta 
in the future,’ he says. 

Most of the flow will come in a three-day 
pulse, from 27 to 29 March, which is meant to 
mimica small-scale spring flood (K. W. Flessa 
et al. Eos 94, 485-486; 2013). “It’s structured 
so that we hopefully get a big pulse crossing 
over into places that have a lot of restoration 
potential,” says team co-leader Karl Flessa, a 
geoscientist at the University of Arizona in 
Tucson. “And we hope that the groundwater 
doesn’t recede too quickly after that.” 

In the days and weeks after the water pulse, 
monitoring teams will descend on the delta, 
because no one is entirely sure how long the 
water will last or what its exact course will 
be. Hydrologists will gather data on changes 
in surface water and groundwater; ecologists 
will count saplings and determine whether that 
new habitat is attracting birds. 

The experiment could benefit the almost 
400 bird species that live in the delta, says Osvel 
Hinojosa Huerta, an ecologist with Pronatura 
Noroeste, a conservation group with offices 
in San Luis Rio Colorado, Mexico. Every year, 
about 400,000 migratory waterbirds also pass 
through the region, an important stop on their 
flyway along the Americas. 

In recent decades, invasive saltcedar trees 
have crowded out much of the native habitat. 
The water pulse is designed to create broad 
expanses of moist sand in which native cotton- 
woods and willows can germinate, says Patrick 
Shafroth, a plant ecologist with the US Geo- 
logical Survey in Fort Collins, Colorado. 

The challenge will be to keep those newborn 
trees alive. Shafroth helped to lead related 
experiments on a Colorado River tributary, 
the Bill Williams River in Arizona, in 2005 and 
2006. He says that timing is critical for getting 
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seeds distributed to sites where they can take 
root and grow quickly enough to survive. 
Thanks in part to such lessons, the delta pulse 
flow is designed to try to keep the water table 
from dropping by more than 2.5 centimetres 
per day, the rate at which cottonwood roots 
can keep up. 

In addition to the eight-week pulse flow, 
dam operators will also release a ‘base flow of 
65 million cubic metres, on a schedule that is 
yet to be determined. The idea is to try to nour- 
ish and maintain any new growth that appears 
this spring. 

Team leaders acknowledge that plenty of 
things could go wrong. Water might vanish 
into unexpected places, seeping deep under- 
ground where it does little good. Trees might 

not germinate if their 


“We hope that seeds do not spread 
the groundwater properly. And many 
doesn’t recede saplings could perish 
too quickly.” if the summer is par- 


ticularly brutal. But a 
successful experiment could build support for 
future releases to the delta, although only one 
is provided for by the Minute 319 agreement. 

Whatever happens during the experiment 
will almost certainly come as a surprise, says 
John Schmidt, an ecologist at Utah State Uni- 
versity in Logan, who also heads the US Geo- 
logical Survey’s Grand Canyon Monitoring 
and Research Center in Flagstaff, Arizona. In 
1996, Schmidt helped to lead some larger engi- 
neered floods much higher up the Colorado. 
The goal was to restore sandbars and beaches 
in areas such as Grand Canyon National Park, 
but early attempts failed because the new- 
born beaches quickly eroded (see Nature 420, 
356-358; 2002). 

“We don’t really know what’s going to 
happen,’ says Flessa. “If we did, we wouldn't be 
doing the experiment.” = 
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Thomas Insel wants studies to identify the biological mechanisms that underlie psychiatric symptoms. 


NIH rethinks 
psychiatry trials 


Mental-health division will no longer fund research aiming 
to relieve symptoms without probing underlying causes. 


BY SARA REARDON 


r | Thomas Insel, the director of the US 
National Institute of Mental Health 
(NIMH), has had enough of shooting in 

the dark. He thinks that ifa clinical trial ofa psy- 

chiatric therapy fails, scientists should at least 
learn something about the brain along the way. 

Now Insel is translating that beliefinto action: 
the NIMH, based in Bethesda, Maryland, has 
decided to stop funding clinical trials that aim 
merely to ease patients’ symptoms. “Future trials 
will follow an experimental medicine approach 
in which interventions serve not only as poten- 
tial treatments, but as probes to generate infor- 
mation about the mechanisms underlying a 
disorder’, he wrote in a 27 February blog post 
announcing the move. This funding switch, 
which will affect grants due to be made ina few 
months’ time, intensifies the NIMH’s apparent 
shift in emphasis from abstract psychiatry to the 
neurobiological roots of disease. 

“It’s a totally new departure for us,” says 
Bruce Cuthbert, a clinical psychologist and 
director of the institute’s adult translational- 
research division. Insel notes that the NIMH 
spent about US$100 million on clinical trials in 


2013, and says that more than half of recipient 
projects received funding without any require- 
ment to examine the biological processes 
involved ina disease. In many cases, “if you get 
a negative result you have no idea why, and you 
have to try something else at random’, Cuth- 
bert says. “It’s an incredible waste of money.” 

The newrules, which will apply to the grant 
cycle that begins in June, also seek to increase 
transparency by requiring faster online reg- 
istration of trials and stricter guidelines for 
reporting results. Insel acknowledges that 
researchers may have to rework their studies 
to satisfy the new guidelines. “I think this will 
be really unpopular,’ he says. 

History shows that Insel is not daunted by 
controversy. In April 2013, he raised eyebrows 
when he announced that the NIMH would 
stop using the Diagnostic and Statistical Man- 
ual of Mental Disorders 5 (DSM-5), the latest 
version of the American Psychiatric Associa- 
tion’s diagnostic guide, to classify mental dis- 
orders (see Nature http://doi.org/rvd; 2013). 
The book’s definitions tend to lump patients 
together by symptoms, which often do not pre- 
cisely map to what is wrong with their brains, 
he says. With this haphazard approach to trials, 
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even if symptoms are alleviated, researchers 
still may not understand what caused them. 
“We've studied drugs, not disorders — if you 
throw something at the wall, and P is less than 
0.05, you win,” he says. Such thinking, he adds, 
wastes time and money. 

Michael First, a psychiatrist at Columbia 
University in New York, says that targeting the 
wrong patient groups can also make therapies 
seem less effective than they really are. A treat- 
ment that helps a tiny fraction of people with 
a DSM diagnosis of depression, for example, 
might be highly effective for all people with a 
more specific trait, such as the inability to enjoy 
life. Beginning a trial with an idea abouta treat- 
ment’s mechanism may allow researchers to 
quickly determine whether to expand or end 
the testing, First says. That could help to tempt 
drug companies to develop new classes of drugs 
for mental illnesses — an endeavour that many 
have abandoned after decades of failed research. 

But Harold Pincus, a psychiatrist also at 
Columbia University, cautions that the data- 
driven approach is not a sure thing. Focusing 
trials on more specific biological targets will 
require researchers to think very carefully 
about how they recruit and classify their 
research subjects. Neuroscientist Michael 
Owen of Cardiff University, UK, concurs. 
“It's a pretty scary prospect for everyone if you 
throw away the diagnostic rule book,’ he says. 

Others are strongly opposed to the NIMH’s 
new approach. Psychiatrist Allen Frances, an 
emeritus professor at Duke University in Dur- 
ham, North Carolina, worries that an emphasis 
on brain science comes at patients’ expense. 
“Dont bet everything you have on something 
so difficult,” he says. “We have a shameful 
problem of neglect of existing patients and an 
empty promissory note to the future that sci- 
ence will solve everything.” Understanding the 
workings of the brain is, of course, a fine goal, 
he says, but “it will be decades and decades 
before anyone will be helped by this research”. 

Insel maintains that the NIMH wants the 
trials it funds to continue to ameliorate patients’ 
symptoms, but to probe how their brains work 
at the same time. His emphasis on biological 
mechanisms is not unique at the US National 
Institutes of Health, the NIMH’s parent agency: 
the National Institute of Neurological Disor- 
ders and Stroke in Bethesda is increasingly 
aiming therapies at specific targets in the brain 
rather than abstract constellations of symp- 
toms. “We've all been burnt badly by treating 
the brain as a black box,” says that agency’s 
deputy director, neurologist Walter Koroshetz. 

The difference, Koroshetz adds, is that a 
great deal is already known about how neu- 
rological disorders such as Parkinson's disease 
work, and that allows scientists to focus on 
other challenges, such as adapting drugs that 
work in animals to work in humans. Psychi- 
atry has a long way to go before it will have 
such accurate targets. “I feel bad for them,” 
Koroshetz says. m SEE EDITORIAL P.273 
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Deep brain stimulation has shown promise in treating conditions such as Parkinson’s 
disease. Now scientists are using the technology to eavesdrop on problem neural circuits. 


BY HELEN SHEN 


or Frank Donobedian, sitting still is a challenge. But 
on this day in early January, he has been asked to do 
just that for three minutes. Perched on a chair in a 
laboratory at Stanford University in California, he 
presses his hands to his sides, plants his feet on the 
floor and tries with limited success to lock down the 
trembling in his limbs — a symptom of his Parkinson's disease. Only 
after the full 180 seconds does he relax. 

Other requests follow: stand still, lie still on the floor, walk across the 
room. Each poses a similar struggle, and all are watched closely by Helen 
Bronte-Stewart, the neuroscientist who runs the lab. 

“Youre making history,’ she reassures her patient. 

“Everybody keeps saying that,” replies the 73-year-old Donobedian, a 
retired schoolteacher, with a laugh. “But I'm not doing anything” 

“Well, your brain is,” says Bronte-Stewart. 

Like thousands of people with Parkinson's before him, Donobedian 
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is being treated with deep brain stimulation (DBS), in which an implant 
quiets his tremors by sending pulses of electricity into motor areas of his 
brain. Last October, a team of surgeons at Stanford threaded the device's 
two thin wires, each with four electrode contacts, through his cortex into 
a deep-seated brain region known as the subthalamic nucleus (STN). 
But Donobedian’s particular device is something new. Released to 
researchers in August 2013 by Medtronic, a health-technology firm in 
Minneapolis, Minnesota, it is among the first of an advanced generation 
of neurostimulators that not only send electricity into the brain, but can 
also read out neural signals generated by it. On this day, Bronte-Stewart 
and her team have temporarily turned off the stimulating current and 
are using some of the device’s eight electrical contacts to record abnor- 
mal neural patterns that might correlate with the tremors, slowness of 
movement and freezing that are hallmarks of Parkinson's disease. 
Until now, such data have been accessible only when a patient’s brain 
is exposed briefly during surgery. But being able to make long-term 
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neural recordings from human patients may become increasingly 
important — especially because researchers are experimenting with 
using DBS as a treatment for many other neurological conditions, 
including depression, obsessive-compulsive disorder and Tourette's 
syndrome. The networks involved in such disorders are even less well 
understood than those involved in Parkinson's disease, says Helen 
Mayberg, a neurologist at Emory University in Atlanta, Georgia. Devices 
such as Donobedian’s could change that, allowing scientists to start to 
understand just how unhealthy neural networks misfire in different 
diseases, and what DBS actually does to the brain. “Every disease will be 
different and one size wont fit all” Mayberg says. “The new technology 
is going to enable progress exponentially.” 

Eventually, adds Bronte-Stewart, engineers could use the new-found 
knowledge about brain networks to build 
even more-advanced brain implants — 
devices that could interpret the neural signals 
they record, monitor their own effectiveness 
and generate personalized treatments. 

“This is such an exciting time,” she says. 
“This is the first time we're really getting a 
window into the brain” 


‘BLACK BOX’ BEGINNINGS 

The roots of DBS reach back to the 1960s, 
when Parkinson's disease was commonly 
treated with surgery to remove or destroy 
certain brain regions. To pinpoint which areas to target in each patient, 
some neurosurgeons began to experiment with electrical stimulation. 
They discovered that the delivery of rapid pulses to the basal ganglia 
— a cluster of structures including the STN — could markedly reduce 
the patient’s tremors. By the late 1980s, long-term brain stimulation 
started to emerge as an alternative treatment to surgery’. DBS has since 
been approved for the treatment of Parkinson’s and other movement 
disorders by both the US Food and Drug Administration (FDA) and 
European regulators, and has been used in more than 100,000 people. 

The biological mechanism underlying DBS remains mysterious, and 
is a subject of controversy. “We've been guessing a lot over the last dec- 
ade or two,’ says Michael Okun, a neuroscientist at the University of 
Florida in Gainesville. “It would be premature for anyone to claim they 
know exactly how the therapy works.” 

There are some clues, however. For example, DBS is not thought to 
mimic any natural signals in the brain. The high-frequency pulses — 
delivered at 130-180 times per second for Parkinson's disease — exceed 
the 1-100-hertz frequency range of most natural neural communica- 
tions. Furthermore, with each 60-90-microsecond burst, DBS typically 
delivers several orders of magnitude more current than any neuron or 
groups of neurons can produce. 

And it does not seem to produce permanent changes in the brain, at 
least not when applied to Parkinson's disease, currently one of the most 
common targets of the technology. Turning on the current can produce 
immediate relief from symptoms such as tremor and rigidity. But in many 
people, symptoms return seconds or minutes after the device is turned 
off, or the battery runs out — which happens every 3-5 years. Nor does 
the therapy halt the progressive neurodegeneration associated with the 
disease; in the long run, patients will typically succumb to symptoms that 
are not well treated by DBS, such as cognitive deterioration. 

From the evidence gleaned so far, researchers suspect that DBS does 
more than affect neural tissue at the site of the electrodes: it somehow 
disrupts pathological signals that reverberate through multiple brain 
regions, corrupting their communications (see ‘Circuit training’). 

That theory meshes with the emerging view that Parkinson’ disease, 
as well as depression and many other neuropsychiatric conditions are 
best understood as network dysfunctions. “That’s a really important 
realization that has caught on in the last five years,” says Cameron 
McIntyre, a biomedical engineer at Case Western Reserve University in 
Cleveland, Ohio. Indeed, it has helped to launch two major neuroscience 
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efforts in the past year: the US Brain Research through Advancing Inno- 
vative Neurotechnologies (BRAIN) Initiative, and the European Union's 
Human Brain Project. 

The primary target of DBS for Parkinson's disease, for example — 
the STN — sits in the middle of a highly interconnected brain network 
that helps an individual to control his or her motions. There is some 
evidence’ that as Parkinson's destroys neurons in the basal ganglia, the 
activity of groups of cells in the STN and across this sensorimotor net- 
work becomes abnormally synchronized, locking at certain frequencies. 
DBS seems to release them from these activity patterns, as do some of 
the drugs that relieve Parkinson's symptoms™*. 

Recordings from the new generation of neurostimulators are poised 
to elucidate these mechanisms, not just for Parkinson’s but also — as 
DBS applications broaden — for psychiatric 
conditions. The data could help to resolve 
concerns about the wisdom of expanding 
the treatment’s usage. Although the sensori- 
motor network involved in Parkinson's 
disease has been mapped in great detail, 
says Joseph Fins, a medical ethicist at Weill 
Cornell Medical College in New York City, 
much less guidance is available on how best 
to apply the technology to other disorders. 
“There has got to be a biological rationale 
for what you're intending to do,” he says. 

But others argue that controlled testing 
of DBS in humans need not wait for complete or near-complete under- 
standing of the relevant networks. “As a clinician, that’s not really the 
important question,’ says Benjamin Greenberg, a psychiatrist at Brown 
University in Providence, Rhode Island. “The real questions are: do 
these treatments help people? Are they safe?” 

Okun adds that, unlike the field of movement disorders, the mecha- 
nistic study of neuropsychiatric disorders has been slowed by a lack of 
realistic animal models. “If we're going to move forward with some of 
these human diseases, we are going to have to use humans — ina very 
careful way, of course,” he says. 


ZOOMING IN 

Mayberg has been doing just that for more than a decade. In 2005 she 
published one of the first studies on the use of DBS to alleviate severe, 
treatment-resistant depression’. Since then, she has mainly focused her 
experiments on a structure known as the subgenual cingulate, in which 
elevated metabolism has been shown to correlate with the severity ofa 
patient’s depression®. She estimates that the use of DBS in this region and 
elsewhere has successfully eased symptoms in 40-60% of the roughly 
150 cases of depression reported on so far. But in recent years, her group 
has begun to do better by using brain imaging to map the dense web of 
nerve fibres zigzagging through and around the subgenual cingulate, 
which connects to regions involved in learning, motivation, appetite 
and sleep. Combining this information with the effects seen in patients, 
Maybergis zeroing in on millimetre-scale differences in electrode place- 
ment that can make the difference between success or failure. 

Potentially, she says, new implants such as the device being tested by 
Bronte-Stewart could help her team to do even better, allowing researchers 
to monitor patients’ condition in real time and fine-tune the stimulation 
pulses to maximize benefit. “There may be an optimal tuning frequency 
for a given person, and it may not be the same for everyone,’ she says. 

Creating personalized DBS treatments is a top priority in this field. 
Just before Donobedian’s meeting with Bronte-Stewart, his neurologist, 
Camilla Kilbane of Stanford University, spends half an hour tuning the 
device's stimulation settings to address his symptoms. 

Using a short-range radio device, she programs a pulse generator 
implanted in Donobedian’s upper chest. The generator — about half the 
size of a deck of cards — sends electrical pulses through insulated wires 
that run under the skin of his neck and scalp, and into his brain. Kilbane 
has already determined during a previous visit the subset of electrode 
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contacts she wants to tweak, and Donobedian has stopped taking his 
supplementary Parkinson's drugs overnight so that Kilbane can cleanly 
isolate the effects of neurostimulation. 

As she drops the voltage and the implant can no longer overcome 
Donobedian’s tremors, his hands and feet begin to quiver again. Within 
seconds, the tremors grow and spread, until his arms clap against his 
sides and his shoes tap the linoleum floor. Kilbane clicks the voltage up 
again, and Donobedian’s limbs calm down — but then his arms begin to 
tingle, a common side effect of DBS. At intermediate voltages, his right 
leg stops shaking, but the other continues to tremble. 

“It’s stubborn, that left foot!” remarks Kilbane. She spends another 
10 minutes inching the voltage up and down, gradually homing in on 
an optimal setting. Even after this, Donobedian may need to return in 
the coming months for further fine-tuning. 

“What we have right now for DBS works, but it’s very much the first 
generation,’ says Bronte-Stewart. She and others are using the new 
recording-capable DBS implants as a stepping stone towards ‘closed- 
loop’ neurostimulators — devices that can continuously track an indi- 
vidual’s brain activity and automatically optimize settings as needed 
in real-time. As a first step, the Stanford group is beginning to mine 
the electrical recordings downloaded wirelessly from the implants in 
Donobedian and other patients to find patterns that correlate with dif- 
ferent Parkinsonian symptoms. They are also looking to see how these 
patterns might change in the context of different actions, such as sitting, 
standing and walking — data that could not be obtained with bulky hos- 
pital machines. Indeed, Bronte-Stewart says, there may not be just one 
set of ‘optimal’ stimulation parameters. “We may find out there are dif- 
ferent frequency ranges that are better for different functions,” she says. 


SMARTER STIMULATION 

As scientists collect more data, some manufacturers are already starting 
to make strides in closed-loop technology. Last November, the FDA 
approved the first closed-loop, implantable neurostimulator for intrac- 
table epilepsy, another disorder attributable to network dysfunction. 
The device, made by NeuroPace in Mountain View, California, moni- 
tors neural networks for the first sign of abnormal activity — which 
in some patients originates again and again at one or a few ‘epileptic 
foci’ — then responds with a pulse of electrical current to prevent a 
seizure. “We use stimulation to disrupt that abnormal activity so that it 
doesn’t get picked up by the adjacent neurons,’ explains Frank Fischer, 
the company’s chief executive. 

But Fischer concedes that, whatever the device might do for epilepsy 
treatment, the technology is not immediately applicable to other condi- 
tions. Epilepsy is a comparatively simple disorder, generally consisting 
of discrete episodes of abnormal brain activity. By contrast, Parkinson’s 
disease involves a mishmash of symptoms that rise, fall and morph over 
time. Researchers are still searching for the relevant neural signatures in 
Parkinson’s and other diseases, and developing the computational tools 
required to keep up with changing symptoms. 

The first laboratory demonstration of a closed-loop DBS system for 
Parkinson's disease was reported last year by experimental neurolo- 
gist Peter Brown at the University of Oxford, UK, for a group of eight 
patients’. Brown plugged the patients’ DBS implants into an external 
machine, which triggered stimulation of the STN only when certain 
abnormal brain rhythms were detected. This selective stimulation 
improved the symptoms by almost 30% compared with standard DBS 
treatments, which stimulate the brain at regular intervals. 

“It’s far short of being introduced into patients,’ says Brown of the bulky 
experimental system, but the demonstration does provide an important 
proof that the closed-loop concept could work for Parkinson's disease. 

In an effort to accelerate the move towards closed-loop technology, the 
US Defense Advanced Research Projects Agency (DARPA) last October 
announced a 5-year, US$70-million programme to support the devel- 
opment of novel brain stimulators. As part of the BRAIN Initiative, the 
project aims to foster brain implants to treat conditions such as post-trau- 
matic stress disorder, anxiety and traumatic brain injury. The agency is 
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In Parkinson's disease, cell death in the basal ganglia disrupts normal 
brain signalling, and is thought to send neural static through the 
large-scale network that controls movement (arrows; only one side 
shown). Deep brain stimulation seems to repress that static. 


The subthalamic nucleus (STN) 
is the most common target 

for deep brain stimulation 

to treat Parkinson's disease. 


looking for implantable devices that can monitor and manipulate neural 
activity not just at one or a few sites at a time, but across entire functional 
networks of neurons. Accomplishing this goal will require the develop- 
ment of new types of miniaturized sensor, as well as detailed network 
models of brain function to interpret data streaming in from multiple 
brain areas, says DARPA programme manager Justin Sanchez. 

Some of those models may eventually grow out of data from research- 
ers such as Kendall Lee, a neurosurgeon at the Mayo Clinic in Rochester, 
Minnesota. At last year’s Society for Neuroscience meeting, he presented 
a prototype DBS system called Harmoni that can deliver current to one 
area of the brain while recording electrical and neurochemical responses 
elsewhere (see Nature http://doi.org/rvj; 2013). Because the brain uses 
both electrical and chemical signals to communicate, explains Kevin Ben- 
net, the lead engineer on the project, monitoring each type of data could 
provide more complete information about what is going on. The group 
intends to test Harmoni first in patients with movement disorders. But, 
ultimately, the scientists hope to extend combined chemical and electri- 
cal monitoring to psychiatric disorders. “Those will be the most difficult 
to treat,” says Bennet. “The symptoms are harder to detect and quantify.” 

Bronte-Stewart projects that testing might begin in about five years 
for the first implantable, closed-loop DBS devices for Parkinson's dis- 
ease, with psychiatric applications following close behind. It is not clear 
whether Donobedian and other current research volunteers could be 
easily upgraded to those systems; much depends on the precise design 
of the devices. But even if he does not benefit directly from the data he 
is generating, Donobedian is glad to participate. 

“Somebody had to give to me, to get this far,” he says. “If there's a 
chance for me to give something back without too much effort, I'd like 
to help’ m SEE EDITORIAL P.273 


Helen Shen is a freelance writer in Mountain View, California. 
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Moore's law for US$100 million 


The $1,000 
genome 


With a unique programme, the 
US government has managed 
to drive the cost of gnome 
sequencing down towards a 
much-anticipated target. 


Cost of genome 
sequencing. 


$10 million 
n Silicon Valley, ne 
[Bere law seems to stand 
on equal footing with the natural 
laws codified by Isaac Newton. Intel co-founder 
Gordon Moore’ iconic observation that computing 
power tends to double — and that its price there- 
fore halves — every 2 years has held true for nearly 
50 years with only minor revision. But as an exemplar 
of rapid change, it is the target of playful abuse from 
genome researchers. 

In dozens of presentations over the past few years, 
scientists have compared the slope of Moore’s law with 
the swiftly dropping costs of DNA sequencing. For a while 
they kept pace, but since about 2007, it has not even been 
close. The price of sequencing an average human genome 
has plummeted from about US$10 million to a few thousand 
dollars in just six years. That does not just outpace Moore's 
law — it makes the once-powerful predictor of unbridled pro- 
gress look downright sedate. And just as the easy availability of 
personal computers changed the world, the breakneck pace of 
genome-technology development has revolutionized bioscience 
research. It is also set to cause seismic shifts in medicine. 

In the eyes of many, a fair share of the credit for this success goes 
toa grant scheme run by the US National Human Genome Research 
Institute (NHGRI). Officially called the Advanced Sequencing Tech- 
nology awards, it is known more widely as the $1,000 and $100,000 
genome programmes. Started in 2004, the scheme has awarded 
grants to 97 groups of academic and industrial scientists, including 
some at every major sequencing company. 

It has encouraged mobility and cooperation among technologists, 
and helped to launch dozens of competing companies, staving off 
the stagnation that many feared would take hold after the Human 
Genome Project wrapped up in 2003. “The major companies in the 
space have really changed the way people do sequencing, and it all 
started with the NHGRI funding,” says Gina Costa, who has worked 
for five influential companies and is now a vice-president at Cypher 
Genomics, a genome-interpretation firm in San Diego, California. 


BY ERIKA CHECK HAYDEN 


$1 million 
As next-generation a 
sequencers entered 
the market, the 
price dropped 
precipitously. 


$100,000 


A GIANT’S LEGACY 

The $1,000 genome programme, now close to achieving its goal, 
will award its final grants this year. As technology enthusiasts look 
to future challenges, the coming milestone raises questions about 
how the roughly $230-million government programme managed 
to achieve such success, and whether its winning formula can be 7 ae. 

: c ou ne price of sequencing 
applied elsewhere. It benefited from fortuitous timing and the lack of a whole human genome 
an entrenched industry. But Jeffery Schloss, director of the division hovers around $5,000 
of genome sciences at the NHGRI in Bethesda, Maryland, who has a ae to drop 
run the programme from its inception, says that its achievements 
also suggest that there are ways to navigate public-private partner- 
ships successfully. “One of our challenges is to figure out what is 
the right role for the government; to not get in the way, but feed the 
pipeline of private-sector technology development; he says. 

The quest to sequence the first human genome was a massive $1,000 


$10,000 
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undertaking. Between 1990 and the publication of a working draft in 
2001, more than 200 scientists joined forces in a $3-billion effort to read 
the roughly 3 billion bases of DNA that comprise our genetic material 
(International Human Genome Sequencing Consortium Nature 409, 
860-921; 2001). It was a grand but sobering success. The project's advo- 
cates had said that it would reveal ‘life’s instruction book’ but in fact it did 
not make it possible to interpret how the instructions encoded in DNA 
were transformed into biology. Understanding how DNA actually influ- 
ences health and disease would require studying examples of the links 
between genes and biology in thousands, perhaps millions, more people. 

The dominant technology at the time was Sanger sequencing, an 
inherently slow, labour-intensive process that works by making cop- 
ies of the DNA to be sequenced that include chemically modified and 
fluorescently tagged versions of the molecule’s building blocks. One 
company, Applied Biosystems in Foster City, California, provided the 
vast majority of the sequencers to a limited number of customers — 
generally, large government-funded laboratories — and there was little 
incentive for it to reinvent its core technology. 

Still, researchers had seen some advances, including robots that 
replaced some human work and improvements in devices capable of 
handling small amounts of liquid. At a 2002 meeting convened by the 
NHGRI scientists predicted that such developments would drive costs 
down at least 100-fold over the next five years. But that was not enough. 

They debated what price target would make human genome sequenc- 
ing routine, the kind of thing a physician might order to help diagnose a 
patient — ona par with a magnetic resonance imaging scan. “Somebody 
threw out, to great rolling of eyes, ‘a thousand dollars?’ recalls Schloss. 

That seemed too ambitious, given the state of the technology. “The 
risk associated with that is not one that your normal investor is willing 
to spend any money on,’ says Eric Eisenstadt, a retired official from the 
US government’s Defense Advanced Research Project Agency who is 
now aconsultant in Reston, Virginia. 

So Schloss and the NHGRI stepped in and began to fund basic research 
on entirely new methods of sequencing, as well as industrial research to 
develop these technologies for commercial use. The mixture of applied 
and academic research within a single programme was uncommon at 
the National Institutes of Health (NIH), the NHGRI’ parent agency. The 
project was also more nimble than the typical NIH grant programme 
because it allowed the agency to make small awards for work considered 
promising but risky. “That flexibility is unusual for the NIH,’ says Schloss. 

Furthermore, the programme provided support to sequencing com- 
panies that could compete with Applied Biosystems. One of the compa- 
nies funded in the first round of grants, 454 Life Sciences of Branford, 
Connecticut, was the brainchild of entrepreneur Jonathan Rothberg. 
It aimed to develop a method that was faster and cheaper than Sanger 
sequencing by using a much simpler sample-preparation procedure and 
running many sequencing reactions simultaneously on a solid surface. 
But as he tried to round up funding, Rothberg heard the same refrain 
over and over from investors. “People said, “Why would you want to 
sequence DNA fast? We've already done the Human Genome Project.” 

A $7-million award from the NHGRI allowed the company to com- 
mercialize a technology called pyrosequencing, which was the first to 
begin chipping away at Applied Biosystems’ monopoly. 

The funding commitments also ultimately helped to convince pri- 
vate investors to enter the market. Stephen Turner, founder and chief 
technology officer of Pacific Biosciences in Menlo Park, California, says 
that his company’s 2005 NHGRI grant of $6.6 million helped to attract 
subsequent venture-capital funding. The NHGRI’s imprimatur had 
convinced investors to provide the much more substantial money that 
the company needed to commercialize its technology, which observes 
DNA synthesis as it occurs in real time. 

“Having experts in sequencing technology give us a favourable score 
was hugely influential,” says Turner. 

The NHGRI’s investments — typically a few million dollars or less 
— could not by themselves nurse a technology from lab to market. But 
they could fund parts of it, such as work on improving a dye, a piece of 
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circuitry ora laser, or tests of combinations of components. 

The programme has invested $88 million in technologies based on 
nanopores and nanogaps. The form of this technology closest to the 
market involves reading bases as they are threaded through a pore (see 
Nature 456, 23-25; 2008), a method that has long promised to save costs 
and time by reading DNA while it is processed. It would negate the need 
for expensive and slow reactions to make lots of copies of the molecule. 
But solving basic issues, including how to move the DNA through the 
pore slowly enough, has been a major challenge. The NHGRI has funded 
work to overcome these hurdles — including $9.3 million given to col- 
laborators of the company now ushering the concept to market, UK- 
based Oxford Nanopore Technologies (see Nature http://doi.org/rvm; 
2014). Turner says that such investments have helped to cut sequencing 
costs before the technology hits the shelves. 

The $1,000 genome project seeded so many 


“There’s a companies and labs that it populated the 
tremendous entire industry with expertise, say sequenc- 
ing researchers. One of the beneficiaries of 

amount of i that is Illumina in San Diego, currently the 
altruistic sharing market leader in sequencing machines. Illu- 


mina, whose technology reads out many short 
stretches of DNA, has acquired multiple com- 
panies and many scientists who were once supported by the NHGRI. “It’s 
through acquisitions that Illumina has become stronger and stronger,’ 
says Mostafa Ronaghi, the company’s chief technology officer. 

But Schloss’s programme also forced competitors to exchange expertise 
at an annual progress meeting that has become a must-attend event. “That 
meeting is one of the most important venues for keeping an eye on what's 
happening in sequencing technology development,’ says Turner. “There's 
a tremendous amount of altruistic sharing of knowledge that occurs.” 


of knowledge.” 


CUT-PRICE CAVEATS 

Some scientists question certain choices made by the programme. 
“There's been a lot of money given to the nanopore space, but the objec- 
tive of nanopore really hasn't hit the mark,” says Costa, for example. 

Kevin McKernan, who worked with Costa to develop SOLiD — a 
sequencing technology based on an enzyme that joins pieces of DNA 
together — points out that many of the companies funded by the $1,000 
genome programme ultimately failed. “Their hit rate probably isn’t 
much better than a venture capitalist,” he says. 

But others give Schloss and the programme credit for spreading their 
investments over an array of academic and industrial work that is diverse 
enough to allow progress in the face of failures. Many NHGRI-funded 
firms are now defunct — including 454, and Helicos BioSciences of Cam- 
bridge, Massachusetts — but other grant recipients have moved the field 
forward, often using ideas generated by the shuttered firms. 

“The NHGRI funded smaller companies and academic groups to 
create a pipeline of technologies,” says Ronaghi. “They didn’t decide 
which technologies to bet on” 

Working out what the $1,000 genome programme got right has 
emerged as a key question as Schloss and the NHGRI shape its successor. 
Sequencing still needs much improvement, especially in terms of qual- 
ity. For all of Sanger sequencing’s high cost, it remains the benchmark 
for accuracy. And sequencing costs are no longer dropping as quickly 
as they were a few years ago. 

But researchers are optimistic that another technology will emerge 
to challenge Ilumina. Most think, in fact, that the crucial questions for 
the field will shift away from technology. Now that sequencing is cheap 
enough to talk about scanning every patient's genome, or at least the 
protein-coding portion of it, it is still not clear how that information 
will translate into improved care (see Nature http://doi.org/rvq; 2014). 
These more complex issues will require another great leap in genomic 
science — one that could make the trouncing of Moore’s law seem 
easy. m SEE EDITORIAL P.273 


Erika Check Hayden writes for Nature from San Francisco, California. 
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ILLUSTRATION BY DAVID PARKINS 


Industry-funded academic 


inventions boost innovation 


Brian D. Wright and colleagues present data challenging the assumption that 
corporate-funded academic research is less accessible and useful to others. 


overnments have long encouraged 
Cspirnes indy collaboration, 

hoping to spur innovations that 
bring jobs, investment and life-enhancing 
products’. At the same time, shrinking gov- 
ernment budgets for science have forced 
universities to look to other sources of fund- 
ing. According to the US National Science 
Foundation, in 2012, industry supplied just 
over 5% (some US$3.2 billion) of US research 
universities’ annual expenditure’. 

But the role of corporations in academic 
research is controversial. For example, when 
oil company BP announced in 2007 that 
it would pay $500 million to fund a decade 


of alternative-energy research by a consor- 
tium headed by the University of California, 
Berkeley, this prompted a backlash. Fearing 
that industry money would contaminate the 
public institution's research agenda, many stu- 
dents, staff and members of the community 
picketed the campus with a 2.5-metre Trojan 
horse. An earlier agreement between the 
department of plant and microbial biology at 
Berkeley and the Swiss pharmaceutical firm 
Novartis sparked similar opposition. At the 
1999 graduation ceremony, about 100 stu- 
dents displayed the company’s logo on their 
mortarboards, protesting that the department 
had been bought by corporate interests. 
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There are reasons to be cautious about 
corporate sponsorship of academic research’, 
The tobacco, food, pharmaceutical and other 
industries have been shown to manipulate 
research questions and public discourse for 
their own benefit and even to suppress unfa- 
vourable research*. And companies may 
shift university researchers towards narrow 
corporate interests. If the results of research 
are privately held, others cannot exploit them. 

Conversely, some feel that overly restrictive 
university technology-transfer policies stifle 
productive deal-making between firms and 
academic researchers’. Some advocate that 
a university's intellectual property should > 
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> be managed by an outside agency’, or else 
handed over directly to researchers or to the 
companies funding their work’, 

Data to inform this debate are hard to 
come by. Individual universities may track 
patents and licences at their own institutions, 
but these data sets are generally small and 
confidential. The prevailing assumption is 
that corporate-sponsored inventions and 
the information associated with them are 
less accessible and less useful to others than 
inventions sponsored by the government or 
non-profit organizations. 

Here we offer empirical evidence to 
the contrary. Our analysis suggests that 
corporate-sponsored research is surpris- 
ingly valuable for further innovation. Data 
collected over 20 years at nine campuses 
and three national laboratories adminis- 
tered by the University of California show 
that corporate-sponsored inventions are 
licensed and cited more often than feder- 
ally sponsored ones. 

Although results might differ at other 
academic institutions, these findings should 
allay concerns that corporate sponsorship 
turns leading universities into corporate 
vassals. Collecting and combining data from 
a larger sample of institutions could help to 
both explore what corporations hope to gain 
from funding academic work, and suggest 
how universities can best manage research 
sponsorships. 


TECHNOLOGY TRANSFER 

Like most universities, the University of Cali- 
fornia requires faculty members and other 
researchers to disclose any invention that 
has commercial potential to one of its offices 
of technology transfer (OTTs), and to list 
funding sources for the project that led to it. 
Under these terms, an invention is anything 
that a researcher feels could be patented or is 
otherwise valuable as intellectual property: it 
might be a material, a method, or an animal or 
plant. The OTT then determines whether to 
pursue intellectual property protection on the 
university's behalf and negotiates contracts 
with potential licensees. 

From 1990 to 2005, University of Califor- 
nia faculty members, staff and students, and 
employees of the three associated national 
laboratories disclosed 12,516 inventions 
to their OT Ts. Of these, nearly 1,500 were 
supported, at least in part, by corporate 
funds. Under strict terms of confidential- 
ity, the central OTT provided us with data 
on these disclosures, and on related licens- 
ing activities, until the end of 2010. From 
1990 to 2010 the University of California 
campuses accounted for up to 9% of total 
US academic research expenditure. Col- 
lectively, they obtained more issued patents 
than any other US academic institution. In 
lists compiled annually by the US Patent 
and Trademark Office, the multi-campus 
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University of California system often 
had more than twice as many patents as 
the second-largest patent producer in 
academia (generally the Massachusetts 
Institute of Technology in Cambridge). 

Ofall inventions generated at the Univer- 
sity of California, 20% are linked to at least 
one licence, and nearly 25% were eventually 
patented. Inventions with no sponsor infor- 
mation were the least likely to yield either 
licences (13%) or patents (17%). We believe 
that most of these inventions came either 
without extramural support or with federal 
support, which is such a common situa- 
tion that inventors or technology-transfer 
agents may not note it explicitly. Corporate- 
sponsored inventions resulted in licences 
(29%) and patents (35%) more frequently 
than federally sponsored ones (22% and 
26%, respectively). The rates are higher still 
for inventions with both types of sponsor; 
36% were licensed and 43% patented (see 
‘Licensed and cited’). Results were similar 
across technical fields. More than two-thirds 
of classified technologies relate to biologi- 
cal, pharmaceutical and chemical advances, 
a distribution that is consistent with other 
leading research universities (for the com- 
plete results see Supplementary information; 
go.nature.com/o99eua). 

Although corporate-sponsored inventions 
are more likely to be patented, that does not 
mean that corporate support makes inven- 
tions more patentable. Instead, corporations 
might select projects that are more likely to 
produce patentable inventions. 

Corporations typically get priority to nego- 
tiate licences to the inventions they sponsor, 
and 86% of the licences to the sponsors are 
exclusive, meaning that the university agrees 
not to grant the same rights to multiple licen- 
sees. Of licensed inventions associated with 
some form of intellectual property, 78% 
were licensed exclusively, consistent with the 
share of 79% reported for licensing of patents 


p 


funded by the National Institutes of Health*. 

Nevertheless, our analysis did not sup- 
port our original assumptions that licences 
to industry-sponsored inventions would be 
likely to be exclusive, or that sponsors would 
snap up the lion’s share of exclusive licences. 
First, the overall percentage of corporate- 
sponsored inventions licensed exclusively 
(74%) is not higher than for those with solely 
public funding (76%). Second, half of the 
exclusive licences for corporate-sponsored 
inventions seem to be to third parties 
(although we cannot be sure that we identi- 
fied all the sponsor-controlled firms in the 
data). Apparently, even the inventions that 
sponsors leave on the table have substantial 
value, because these licensees usually bear 
significant costs of patenting, plus agree- 
ments to pay future royalties. 

Another surprise is that corporate- 
sponsored inventions spur more ‘knowledge 
spillovers, on average, than federally spon- 
sored research, according to forward citation 
rates, the most widely used metric for patent 
quality and value. Forward citations show 
how many times one patent is cited in sub- 
sequent patents. Each corporate-sponsored 
invention generated, on average, 12.8 forward 
citations if licensed to a third party (more if 
licensed by the sponsor), compared with 5.6 
for federally sponsored inventions. This runs 
counter to the expectation that corporate- 
sponsored inventions have narrow applica- 
tions, and so create more private benefits but 
few benefits for others. 


USING UNIVERSITIES 

This analysis does not address how 
corporate funds affect universities’ research 
agendas, but it does dispute the idea that cor- 
porations tie up all sponsored inventions to 
restrict access. Instead, high patent citation 
rates for corporate-sponsored inventions 
suggest that firms are funding exploratory 
research. Work by sociologist James Evans 


Microneedle fabrication, the subject of one of the most highly cited University of California patents. 
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STOEBER, B. & LIEPMANN, D. J. MICROELECTROMECH. SYST. 14, 472-479 (2005)/IEEE 


UC PATENT TRACKING SYSTEM DATABASE 


LICENSED AND CITED 


Of the 12,516 inventions logged by technology-transfer offices of the University of California system between 1990 and 2005, inventions with only federal 
funding were less likely to be patented or licensed than those with corporate or corporate and federal funding, and had lower patent citation rates. 


EB Wuo FUNDS INVENTIONS? [4 How DO INVENTIONS FARE? 
No sponsor 50 
Information Federal 
3,949 5,572 Most licences 
——— taken by third 
, ‘ 40 parties 
Total inventions a 
logged 5 
5 30 
Other e 
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——_- 8 20 
Federal and © 
corporate Corporate 10 
581 887 
Less than one-fifth of Patented Licensed 
inventions have any Not li al 
corporate funding id Ieensed Dy Sponsels 
Licensed by sponsors 


at the University of Chicago in Illinois sug- 
gests that corporations turn to universities to 
investigate areas outside their core strengths, 
investing in speculative science in the hope 
of finding profit opportunities’. 

In fact, Evans argues that corporations 
actually urge academics to explore further 
afield than they might otherwise. Although 
academics may act conservatively to gain 
acceptance of peers, papers and grant propos- 
als, he writes’, “industry partnerships draw 
high-status academics away from confirming 
established theories and towards speculation”. 

For example, the $500-million research 
grant from BP to the Berkeley-led consor- 
tium was intended to explore biofuels from 
cellulose in plants or crop residues, an area 
in which BP had virtually no expertise. 
In such cases, many resulting inventions 
might turn out to be informative to other 
researchers, but irrelevant to the firm’s 
business strategy. 

In such cases, other firms’ subsequent 
work on an invention can be more valuable 
to sponsors than exclusive access. For exam- 
ple, preliminary work by Yongdong Liu, 
a PhD candidate at Berkeley, suggests that 
information-technology company IBM 
discloses innovations on the periphery of its 
expertise without patenting them, but often 
cites non-IBM patents building on the dis- 
closed innovations. Similarly, some major 
drug companies contributed to the publicly 
funded Human Genome Project, reasoning 
that faster access to results would acceler- 
ate its ability to develop drugs, even if those 
results were openly available. 

Acquiring intellectual property is not 
necessarily the prime focus of corporate 
sponsors. Companies also value sustained 
relationships with leading scientists and 
associated opportunities to identify and 


recruit talented employees. The University of 
California—Novartis agreement apparently 
generated no licences for the company, and 
Novartis representatives reportedly did not 
exert any apparent influence on the selection 
of projects it funded”. 

Joint federal-corporate sponsorship may 
stem from more-focused goals. We under- 
stand that they often arise from projects 
initiated by federal funding agencies, with 
corporate sponsors recruited to develop 
early, promising work into practical appli- 
cations. For example, ifa federally sponsored 
gene-screening programme finds an attrac- 
tive drug target, corporations might support 
projects to screen drug candidates against 
that target. This kind of focus would explain 
why inventions in this category are the most 
likely to be licensed (even by third parties) 
but not more highly cited. 

The large share of third-party licences 
suggests that the University of California 
successfully markets inventions and also 
negotiates agreements to keep corporations 
from locking them up unduly. This task is 
probably facilitated by the fact that many 
sponsoring firms seem to recognize that 
sharing exploratory research can be in their 
own interests. 

To assess whether these findings general- 
ize to other academic institutions, data from 
other research universities are needed. We 
advocate a project to pool similar data from 
a large sample of other research universities, 
with solid confidentiality safeguards, for 
empirical analysis. Such work could evalu- 
ate whether, for instance, groups of smaller 
or less research-oriented institutions would 
be better served by outsourcing to a single 
technology-transfer institution. 

Universities setting up contracts with 
corporations need to be vigilant in their 
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mission to generate and transfer knowledge, 
but they should not assume that companies 
are focused mainly on tying up intellectual 
property. Those that do will miss fruitful 
opportunities for collaboration with firms 
willing to fund projects from which many 
others will probably benefit. = 
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Asolar power plant on a former military air base in Finowfurt, Germany. 


Back the renewables boom 


Low-carbon technologies are getting better and cheaper each year, but continued 
public-policy support is needed to sustain progress, says Jessika E. Trancik. 


new battery is rarely greeted with as 
A= excitement as the latest smart- 

phone or a new drug. The energy 
industry is widely perceived as sluggish, a 
provider of basic services and lacking crea- 
tivity. In fact, a brighter reality is emerging — 
government support for energy-technology 
development is paying off. 

Public policies to encourage the develop- 
ment and adoption of renewable-energy 
technologies are essential, because low- 
carbon performance is not visible to most 
consumers and carbon is not priced in the 
global market. Yet there is a widespread lack 
of confidence in public-sector efforts to spur 
innovation, as a result of the mixed record of 
governments in picking winners and losers 
among technologies’. 

Some governments are considering 
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reducing their support for renewable-energy 
projects. The future of the US tax credit for 
new wind energy is uncertain; the United 
Kingdom is debating scaling down subsidies 
for some renewables and relaxing its targets 
for carbon-emissions reductions, and Spain 
has abandoned its incentives programme 
and electricity-price commitments for 
renewable-energy power plants. The coun- 
tries of the European Union disagree on a 
common binding target for the adoption of 
renewable energy by 2030. 

But now is not the time to cut govern- 
ment support for renewables. Each day that 
we delay implementing low-carbon energy 
technologies we increase the likelihood of 
damage from climate change — from storms 
and floods to forest fires, . 

The response of the global energy industry 
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to even modest policy interventions has 
been remarkable. Led by China, Europe, 
the United States and Japan, the alternative- 
energy sector is booming worldwide’. 
Solar and wind technologies have improved 
most rapidly in the past three decades, with 
photovoltaics a hundred times cheaper today 
than in 1975. 

Governments should help to maintain 
this progress. Research funds and policies 
to boost markets will mature new energy 
industries and promote the next generation 
of low-carbon technologies. 


RAPID INNOVATION 

The speed of energy-technology innovation 
is only just coming to light as long-term data 
sets become available. My analyses of 30 or 
more years of data” * show that the costs of 
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renewable-energy technologies have fallen 
steeply. Photovoltaic module costs have 
plunged by about 10% per year over the past 
30 years and the costs of wind turbines have 
fallen by roughly 5% per year. Production 
levels for both technologies have risen by 
about 30% per year on average. 

The technical advances responsible have 
been driven by public policies and industry's 
responses to them. Governments spend a 
relatively modest amount on renewable- 
energy research, roughly US$5 billion per 
year globally, which is less than one-tenth 
the amount allocated to health research. 
But government incentives are essential for 
market growth; they drive private-sector 
investments in clean-energy technologies 
of about $250 billion per year globally. 

Despite this success, lawmakers in many 
countries are questioning public support for 
clean energy. Some in the United States are 
urging that such support should be limited 
to funding basic research and development 
in universities and government labs. They 
cite the recent failure of a few prominent 
energy companies, such as Solyndra, which 
received government grants or loans in their 
early days. Critics forget that game-changing 


ENERGY INNOVATION 


The number of new patents for energy technologies has risen rapidly in the past three decades, 
as public policies have driven market growth. Solar and wind installations have become much 
cheaper through improved designs and experience gained in building and installing them. 
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technologies are high-risk ventures; some 
failures are inevitable. 

Some technologies are more open to 
improvement than others” ’. Compact, 
modular systems, such as photovoltaics 
and electronics, are easily experimented 
on°”. And processes that may be achieved 
through alternative designs or materi- 
als offer more avenues for advancement. 
The diversity of 


“Game- semiconductors, for 
changing . example, is behind 
technologies the recent develop- 


are high-risk 
ventures; some 
failures are 
inevitable.” 


ment of high-effi- 
ciency perovskite 
solar cells. Other 
technologies are 
harder to improve. 
Those with high commodity costs, such as 
coal-fired electricity’, soon hit cost floors in 
the marketplace. 

Photovoltaic systems and wind turbines 
are therefore better candidates for sus- 
tained cost reduction than large nuclear or 
coal plants’. The lower price of solar cells 
today is due to increasing the efficiency with 
which sunlight is converted to electricity 
within modules, less manufacturing waste 
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and greater economies of scale. 

Wind turbines have seen similar pro- 
gress, reaching higher wind speeds at greater 
heights to deliver more energy per cost of 
installed unit. Indeed, wind energy now 
competes economically with fossil-fuelled 
thermal power plants in several places, 
including Texas, and in Denmark it supplies 
30% of electricity consumption. 

Knowledge about how to design, build 
and integrate these technologies into the 
energy infrastructure has also grown. For 
example, it takes roughly half the time to 
install a solar system in Germany compared 
with the United States, thanks to a more 
experienced workforce and streamlined 
permit processes’*. 

The recent growth in energy patents 
reflects the increasing knowledge generated 
in laboratories and on the manufacturing 
floor. The numbers of patents for solar and 
wind technologies have risen globally by 
about 15% and 20% per year on average, 
respectively, in the past decade’ — rates 
comparable with information technologies. 

A link between public investment and pat- 
ent filing is evident in historic records’. In 
the 1970s and early 1980s, patent numbers 
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for solar and wind technologies rose by 
20% and 15% per year on average globally 
as public money in Japan, the United States 
and Europe was pumped in to find alterna- 
tives to oil during the energy crises. When 
global energy-research funding dropped 
by 70% between 1980 and 1990, patenting 
rates levelled off, causing concern that the 
development of renewables would slow’. 

Patenting rates remained flat for nearly 
two decades until the early 2000s, when they 
rose again’ (see ‘Energy innovatior). This 
time the reason was rising investment from 
industry, in response to various government 
incentives and regulations. 

Certain nations are now clear leaders in 
innovation. Japan dominates in terms of 
cumulative energy patents filed. But in the 
past ten years’ growth has been driven by 
China and the United States, which together 
account for roughly 60% of renewables pat- 
ents published globally per year and 60% of 
all energy patents. China has been the front 
runner in coal patents for 15 years and, in the 
past decade, has overtaken the United States 
in annual patent numbers for wind and solar 
as well. Europe has seen a tripling of renewa- 
bles technologies patents in the past decade, 
whereas the region's patents related to fossil 
fuels have declined. 

Underlying these cost and patenting 
trends is a diversity of national or regional 
policies”, including research funding, 
market incentives such as feed-in tariffs, sub- 
sidies and regulated adoption levels. China 
has set a target to supply 15% of energy for 
electricity production with renewables by 
2020, and Germany plans to produce 35% 
of its electricity through renewables by 2020. 
In the United States, 29 states and the Dis- 
trict of Columbia have adopted renewables 
portfolio standards that require specified 
installation levels of renewable energy. 


MAINTAIN MOMENTUM 
Even with extreme energy-efficiency 
measures, developed countries will need 
to meet 75-100% of their power demand 
with carbon-free sources by 2050 (ref. 4), 
compared with 30% globally today, to hit 
emissions targets. The remainder could 
be produced with clean coal (with carbon 
capture and storage) and natural gas. 
Sustained public policies and monitor- 
ing of technology performance are needed 
to support further progress in a variety of 
low-carbon technologies for electricity, 
transport and heating until they can sup- 
port themselves. Wind and solar energy 
have delivered the greatest bang per buck 
so far, in terms of improvements in cost, 
patenting and market growth, and further 
developments should be supported. Because 
both types of power are intermittent, stor- 
age technologies should also be priorities for 
government investment. 


302 | NATURE | VOL 507 | 20 MARCH 2014 


EVALUATING PERFORMANCE 


The performance of different energy technologies in terms of average emissions and cost 
intensities can be compared against carbon-intensity targets (here for the United States). 
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The rapidly changing landscape of 
energy-technology patents and manufactur- 
ing costs, exemplified by the rise of China 
in these areas, cautions nations and corpo- 
rations to build experience quickly in these 
fast-moving markets. Such momentum is 
evident in the highly efficient assembly lines 
now running at Yingli Solar’s headquarters 
in Baoding, China, and in petrol stations in 
Sweden that offer drivers a choice of natu- 
ral gas, biofuels, diesel and petrol. Students 
at my university are pitching clean-energy 
ideas to start-up investors with a sophistica- 
tion that was rare even among experts ten 
years ago. 

Ways to measure progress towards low- 
carbon goals will be needed so that analysts 
and policy-makers can identify top-perform- 
ing technologies and policies. Technologi- 
cal improvements can be quantified by the 
use of a ‘cost-carbon curve (see ‘Evaluating 


Wind power is on the rise in China. 
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performance’). In this way, the carbon (or 
water or land) intensity of technologies can 
be compared with their competitiveness, 
and evaluated against performance targets 
that depend on particular global or regional 
concerns such as emissions (or water 
consumption or land use)’. 

Any major energy transformation will 
involve stumbles — from technologies that 
flop or companies that close to imperfect pol- 
icy instruments, such as the teething troubles 
with the European emissions-trading scheme. 
Encouraging a diversity of innovations and 
monitoring progress will lessen these risks. 

Governments should recognize the boom 
in energy-technology development and 
continue to support it. As history suggests, 
the resulting momentum could be enough 
to largely decarbonize the world’s energy 
supply by mid-century. = 


Jessika E. Trancik is an assistant professor 
in the Engineering Systems Division, 
Massachusetts Institute of Technology, 
Cambridge, Massachusetts, and at the 
Santa Fe Institute, New Mexico, USA. 
e-mail: trancik@mit.edu 
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IMAGINECHINA/REX 


MAURICIO ANTON/SPL. 


HUMAN EVOLUTION 


An early human relative, Homo antecessor, takes advantage of game animals’ attraction to water (artist’s impression). 


Just add water 


Richard G. Roberts is intrigued by the idea that early 
humans gained an edge by ‘hunting’ for lakes and rivers. 


e humans are a thirsty lot. Our 

bodies are between 55% and 

60% water, and we need to drink 
2 to 3 litres of fluids per day to remain 
healthily hydrated. In the prehistoric past, 
our ancestors sourced their supplies from 
lakes, wetlands and rivers, but these amount 
to just 0.01% of all water on Earth today. 
Fresh water has always been scarce. In The 
Improbable Primate, Clive Finlayson argues 
that the progressive desiccation of the planet 
over the past few million years — especially 
three shifts around 2.8 million, 1.8 million 
and 0.8 million years ago — was the driving 
force behind our emergence. And, he posits, 
it triggered the extinction of our closest evo- 
lutionary cousins, the Neanderthals, about 
30,000 years ago. 

The field of human evolution abounds 
with schemes for how we came to be the 
dominant primate. When and why did 
we leave tropical rainforests and take our 
first steps onto the savannah? How did we 
acquire our competitive edge over other 
apes and hominins (primates more closely 


related to humans 
than to chimpan- 
zees), enabling us to | 
disperse around the 
globe and occupy 
almost every con- 
ceivable habitat? The 
evolution of our large SS 
brains, bipedal gait 4 
and dexterity with Thelmprobable 
tools have beenattrib- Primate: How 
uted to all manner of Water Shaped 
causes — most often [uman Evolution 

. CLIVE FINLAYSON 
some change in the — Qxforq University 
external environment, Press: 2014. 
but also transforma- 
tions in cultural practices, such as increased 
meat consumption or cooking, and popula- 
tion dynamics. 

With his “Water Optimization 
Hypothesis’, Finlayson — a zoologist by 
training — pins his flag securely to the envi- 
ronmental mast. He blazes a trail through 
the past 16 million years of human ancestry, 
from fruit-eating apes roaming the rainforest 
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canopy to Australia’s desert-dwellers, 
exemplars of human survival on the dri- 
est inhabited continent and “the crowning 
achievement of almost 2 million years of 
evolution”. Primates are experts at locating 
key resources in patchy environments, from 
ripe fruit in the rainforest to freshwater oases 
on the open landscape. Natural selection 
rewards those that can exploit these patches 
most efficiently. Thus were born the “rain 
chasers”: humans built for endurance run- 
ning and long-distance walking, equipped 
with portable and perishable objects to 
suit their nomadic lifestyle. The resulting 
increased mobility and large-scale consump- 
tion of meat by our ancestors — perhaps due 
to animal-hunting or scavenging of carcasses 
concentrated around watering holes — are 
unrivalled among primates. Hence the tag 
‘improbable. 

Was water the evolutionary driver or is it 
merely one of the key habitat components 
that we require to survive? Answers cannot 
be sought solely in hominin fossils, which 
are exceptionally rare. So Finlayson scours 
the archaeological and palaeontological 
literature for common features of what our 
ancient forebears called home: some com- 
bination of woody cover, open spaces and 
fresh water. In 1925, the pioneering palaeo- 
anthropologist Raymond Dart proposed 
that these habitat elements were essential to 
early human evolution in southern Africa. 
Finlayson extends this list to the needs 
of later hominin lineages, adding rocky 
outcrops, which proved particularly 
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popular with cool-climate Neander- 
thals occupying the mountain ranges of 
Eurasia. 

Certain regions loom large in The 
Improbable Primate as launch pads for 
nature’s experiments in human evo- 
lution: northeast Africa and Arabia 
(dubbed “Middle Earth’, but free of 
Hobbits), the mountain chains of main- 
land Asia and the continental shelves 
of southeast Asia, exposed when sea 
levels fell during each ice age. In this 
largely descriptive account of human 
biogeography, Finlayson paints with a 
broad brush. Defying convention, he 
lumps all hominins in the past 1.8 mil- 
lion years into a single species, Homo 
sapiens. This will be too large a lump for 
most palaeoanthropologists (and me) 
to swallow. But his focus on hominin 
lineages — rather than individual spe- 
cies — has some merit, given that genetic 
evidence has arisen in recent years of 
ancient admixture between early mod- 
ern humans, Neanderthals and Denis- 
ovans, and of gene flow into Denisovans 
from an unknown archaic hominin. 

An admirable feature of Finlayson’s 
hypothesis is its amenability to scientific 
testing. Further empirical data from fos- 
sils, artefacts and environmental records 
will certainly be valuable, as might a more 
nuanced treatment of individual sites. I 
would have welcomed more informa- 
tion about the chronology of these sites 
(which number more than 400), and the 
processes of burial, weathering and pres- 
ervation that affected them over time. 
Sites close to water and in caves can be 
overrepresented in surveys, because their 
preservation potential is often higher 
than that of desert sites. 

Ecological models are another avenue 
worth exploring. Mathematical mod- 
els that examine foraging patterns and 
responses to habitat fragmentation in 
space and time, and simulations of alter- 
native hypothetical scenarios, can help 
to illuminate what may have happened 
in the past. Such models could indicate 
which combinations of factors were likely 
to have had the greatest effect on hominin 
evolution. Multiple selective pressures 
have been in play over the past 7 million 
years and across the six inhabited con- 
tinents, so untangling these interactions 
will be no easy task. The Improbable Pri- 
mate provides a useful starting point for 
this next great challenge. m 


Richard G. Roberts is Australian 
Research Council Laureate Fellow and 
director of the Centre for Archaeological 
Science at the University of Wollongong, 
Australia. 

e-mail: rgrob@uow.edu.au 
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Ocean currents swirl and eddy in a visualization of NASA satellite data, on show at the British Library. 


Truth is beauty 


Daniel Cressey views the British Library’s first science 
exhibition — a celebration of scientific illustration. 


over the British Library in the form of 

Eduardo Palaozzi’s vast sculpture at 
the library’s entrance, but only now is the 
London institution hosting its first science 
exhibition. Beautiful Science catalogues 
attempts to make sense of the world through 
visualizations from the seventeenth century 
to today, drawing on the vast archives of the 
United Kingdom’s national library. On show 
are graphics from alchemist Robert Fludd’s 
1617 work Great Chain of Being — which 
attempts to explain the Universe from stars 
to animals, vegetables and minerals — toa 
huge collection of modern Circos diagrams 
used to visualize genetic information and 
highlight relationships between species. 

“Infographics are now a staple of every 
newspaper in the country. In many ways this 
seems like anew phenomenon,’ says curator 
Johanna Kieniewicz. “What I was really keen 
to showis that it actually has a very interest- 
ing and rich history.” 

The graphics have a many-layered power. 
“The visual representation of science can 
increase both the 
engagement of fel- 
low researchers [and] 
the public,” says 
Kieniewicz. 


[c= Newton may have long presided 


For avideo about the 
exhibition, see: 
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Beautiful Science: 
Picturing Data, 
Inspiring Insight 
British Library, London. 
Until 26 May 2014. 


Perhaps the exhi- 
bition’s most famous 
expression of this is 
Nightingale’s Rose, 
from the 1850s. A 
pioneer of modern medicine, Florence 
Nightingale demonstrated the value of 
improved hospital hygiene by showing that 
during the Crimean War, more British sol- 
diers died as a result of poor sanitation in 
hospital than from enemy action. The image 
she drew to illustrate this point, says Kienie- 
wicz, has “changed science, changed the way 
in which things are done”. 

Less transformative but no less impressive 
is William Farr’s failed 1848-49 attempt to 
determine the cause of a cholera epidemic. 
His huge Temperature and Mortality of Lon- 
don plots these two variables in circular 
graphs. Farr’s contemporary, John Snow, 
was more successful in using epidemiologi- 
cal mapping to pin cholera down asa water- 
borne disease; yet Farr’s diagram stands as 
a monument to the difficulties of trying to 
tease causation out of huge data sets. Farr 
eventually came around to Snow’s views, in 
part thanks to Snow’s data presentation. 

Beautiful Science shows that good data 
presentation is timeless. Witness Luke 
Howard, the meteorologist who named the 


NASA/SVS 


three basic cloud types — cirrus, cumu- 
lus and stratus. His diagram of the weather 
recorded in 1815 at his home in Tottenham, 
north London, sits next to a strikingly similar 
display of weather data from 2011. The latter 
is given a modern twist through an overlay 
of social-media commentary by design firm 
Clever'Franke of Utrecht, the Netherlands. 

As science generated more complicated 
data, the methods needed to visualize them 
also became more complicated. Evolution- 
ary theorist Jean-Baptiste Lamarck’s 1809 
table of the relationships between mammals, 
fish, reptiles, crustaceans and other animals 
can be understood easily by anyone with the 
most basic knowledge of French; much more 
parsing is needed for an avian tree of life 
published in Nature in 2012 that sits next to 
it. The exhibition reveals how seriously sci- 
entists need to take data presentation to con- 
vey meaning. Most of the pieces displayed 
were arduously plotted and coloured by 
hand. Says Kieniewicz of today’s spreadsheet 
tools for chart production: “They make it too 
easy. You dont end up putting much thought 
into graphs.” 

Modern scientists, although liberated 
from painstaking drafting and shading, can 
gain from thinking hard about their audi- 
ence, and about precisely what and how they 
want to communicate. A graph ina research 
paper today could be at the British Library 
in years hence, forming part of what Kienie- 
wicz calls scientists’ “visual legacy”. 

Science, the exhibition reminds us, is all 
about beautiful ideas. The trick is finding the 
aesthetic that maintains both meaning and 
elegance. m 


Daniel Cressey is a reporter for Nature in 
London. 


Books in brief 


Governing Marine Protected Areas: Resilience through Diversity 
Peter J. S. Jones ROUTLEDGE (2014) 

Marine Protected Areas, or MPAs — ocean zones limiting human 
activity — cover little more than 2% of the world’s oceans, despite 
an internationally agreed target of 10% by 2020. And thousands of 
those that do exist are little more than ‘paper parks’, many scientists 
have found. Entering these choppy waters is geographer Peter 
Jones, who shows, through some 20 case studies, how and how 
not to govern MPAs effectively. Jones compellingly concludes that a 
diversity of incentives, from economic to social, is as essential as 
the diversity of the ecosystems MPAs are designed to protect. 


< Visions of Science: Books and Readers at the Dawn of the 

“MESA stConD Victorian Age 
James Secord OXFORD UNIVERSITY PRESS (2014) 
Angst over scientific literacy is nothing new, notes James Secord. 
The political unease and religious turmoil in early Victorian Britain 
prompted John Herschel and fellow scientific utopians to urge a 
corrective: the dissemination of ‘useful’, or scientific, knowledge 
across society. As Secord shows, access to texts such as Charles 
Babbage’s searing 1830 Reflections on the Decline of Science in 
England helped to foment an intellectual revolution in step with 
those in industry and modern science. 


The Galapagos 

Henry Nicholls PROFILE Books (2014) 

From the plangent cries of blue-footed boobies to the plash of 
swimming iguanas, the Galapagos islands remain pristine — but 
for how long? In this natural and human history of Darwin’s living 
laboratory, Henry Nicholls surfs from geology, oceanography and 
marine biology to resident land species, not least the burgeoning 
population of Homo sapiens. Throughout, he intertwines key 
accounts such as Darwin’s inspired musings on geological uplift and 
the piscine encounters of pioneer diver William Beebe. One for the 
scientific islomane with a sense of the bigger picture. 


The Rise of Superman: Decoding the Science of Ultimate 

Human Performance 

Steven Kotler NEw HARVEST (2014) 

In extreme athletics, ‘redefining the possible’ is happening at 

an unprecedented rate. In this high-octane study, Steven Kotler 
explores ‘flow’, a neurochemically rich state in which cognitive and 
physiological processes mesh. The stupendous physical feats of the 
late ski-base jumper Shane McConkey and others are riveting. Equally 
surprising is what we know of flow science, such as how the brain’s 
superior frontal gyrus deactivates to speed decision-making — and 
how ‘bliss addiction’ can result from too much exposure to flow states. 


A Garden of Marvels: How We Discovered That Flowers Have Sex, 
Leaves Eat Air, and Other Secrets of Plants 

Ruth Kassinger WILLIAM MORROW (2014) 

After ‘murdering’ a kumquat tree through hard pruning, Ruth 
Kassinger set out to understand plant physiology through the 
history of botany. In this scientific wonderland, we encounter Robert 
Hooke’s exquisite drawings of plant structure in Micrographia 
(1665); arsenic-gobbling brake ferns; and the single eukaryote that, 
1.6 billion years ago, engulfed a cyanobacterium and spawned the 
multitude of chloroplasts that made life on Earth possible. Barbara Kiser 
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Adapt current tools 
for handling big data 


To speed up discoveries of disease 
biomarkers and treatments, 

we must work out a cheaper 

and faster way to process, store 
and use the huge medical data 
sets that are rapidly becoming 
available (see, for example, 
Nature 506, 144-145; 2014). 

By 2015, it is likely that a typical 
hospital will create 665 terabytes 
of data a year (for comparison, 
the web archive of the US Library 
of Congress contains less than 
500 terabytes). This information 
can be used to study and analyse 
treatments — for example, for 
tuberculosis and stroke — and to 
reduce health-care costs. 

To handle such big data 
effectively (see also Nature 498, 
255-260; 2013), we need to adapt 
classical information-processing 
tools. One computational 
challenge is how to manage the 
huge volume of detailed material 
as it becomes available, without 
sacrificing information. Another 
is that the data mostly represent 
physiological processes, the 
characteristics of which change 
over time. 

Current methods are also 
inadequate for analysing 
collective information from 
different sensors, such as multi- 
dimensional descriptions from 
electroencephalography or 
magnetic resonance imaging 
of interactions between brain 
regions. 

Ervin Sejdic¢ University of 
Pittsburgh, Pennsylvania, USA. 
esejdic@ieee.org 


Portugal’s research 
funding is secure 


As president of the Portuguese 
Foundation for Science and 
Technology (FCT), I disagree 
with André Levy’s overly 
pessimistic view of prospects for 
young scientists in the country 
(Nature 506, 159; 2014). 

The cuts in PhD studentships 
and postdoctoral fellowships that 
Levy mentions have been partially 


offset by FCT-funded PhD 
programmes and a funding boost 
for the latest round of fellowships. 
Plenty of opportunities will arise 
for recruitment and retention 

of young scientists through 

the steady funding of research 
projects and centres by the FCT, 
and from increased international 
funding through programmes 
such as the European Union's 
Horizon 2020 scheme. 

In the past two years, the 
FCT has launched initiatives to 
strengthen Portugal's research 
base: PhD programmes and 
career-development schemes; 
differentiated research grants; a 
rigorous evaluation of centres of 
excellence; and top-up incentives 
for centres that bring in European 
or non-public funding. We are 
also building a national road map 
for research infrastructure. 

The exponential growth of 
research and development in 
Portugal in the past 20-30 years is 
set to continue as the challenges 
of quality and international 
competitiveness are addressed. 
Miguel Seabra FCT, Lisbon, 
Portugal. 
presidencia@{ct.pt 


Forests are more 
than sticks of carbon 


Valentin Bellassen and 
Sebastiaan Luyssaert suggest that 
forest managers should improve 
both carbon stocks and timber 
harvests to mitigate climate 
change (Nature 506, 153-155; 
2014). But forests are more than 
just sticks of carbon. 

Maximizing harvests may be 
appropriate in heavily managed 
plantations. But in the remaining 
96% of the world’s forests, it 
could conflict with other forest 
uses and ecosystem services, as 
well as with biodiversity. 

Some 1.6 billion people 
depend on forests to live. So 
optimizing wood production 
and carbon stocks at the expense 
of traditional uses and access 
rights could backfire. These 
people should not become 
carbon refugees. 
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The authors recommend 
“protecting trees from animals”. 
In the African and Asian tropics 
particularly, such a move would 
have to consider the needs of 
elephants, rhinoceroses and 
ungulates, for example, to avoid 
undermining conservation 
efforts and the crucial ecosystem 
functions that these animals 
provide, such as seed and 
nutrient dispersal. 

There is also a risk that 
simplifying forests to optimize 
two ecosystem services would 
reduce forest resilience to pest 
outbreaks and droughts, and 
to climatic and environmental 
changes (R. F. Noss Conserv. Biol. 
15, 578-590; 2001). 

Promoting areas of multi-use 
forest that have various functions 
would provide environmental 
and social benefits, while 
retaining more carbon than 
smaller, single-use areas. 

Aerin L. Jacob, Sarah Jane 
Wilson McGill University, 
Montreal, Canada. 
aerin.jacob@mail.mcgill.ca 
Simon L. Lewis University 
College London, UK. 


Focus on questions, 
not hypotheses 


I contend that the insistence of 
the US National Institutes of 
Health (NIH) on hypothesis- 
driven projects in grant proposals 
could bea factor contributing to 
irreproducible research reports 
(see FE. S. Collins and L. A. Tabak 
Nature 505, 612-613; 2014). 

Isaac Newton argued that 
“hypotheses ... have no place in 
experimental philosophy’, a view 
echoed by mathematician Roger 
Cotes: “Those who assume 
hypotheses as first principles of 
their speculations... may indeed 
form an ingenious romance, 
but a romance it will still be” 
(in I. B. Cohen Introduction to 
Newton’ Principia; iUniverse, 
1999). 

Such criticisms recognize the 
risk that scientists may filter 
data through their hypotheses, 
discounting results that do not 


© 2014 Macmillan Publishers Limited. All rights reserved 


validate the hypothesis as evidence 
that the experiment did not work 
— rather than as evidence that 
the hypothesis is false. 

The NIH’s funding criteria 
should instead ensure that a 
pertinent research question 
is being asked, and that the 
applicant has the means to answer 
it (see D. J. Glass Experimental 
Design for Biologists; Cold Spring 
Harbor Laboratory Press, 2006). 
David J. Glass Novartis Institutes 
for Biomedical Research, 
Cambridge, Massachusetts, USA. 
david.glass@novartis.com 


Update forensics for 
deaths in Japan 


Autopsies in Japan are performed 
on only 1.6% ofall deaths, the 
lowest rate among developed 
countries. Cause of death is 
almost always determined by 
simple visual inspection, as it was 
100 years ago. Japanese forensic 
medicine needs to catch up to 
avoid murders being overlooked. 

In 2011, the National Police 
Agency announced that 
43 murders had been missed 
since 1998, with many found to 
have been disguised as suicides 
or natural deaths. 

The agency is now trying to 
carry out more autopsies, but 
examinations are often cursory 
because of budget restrictions. 
To tackle the problem, rigorous 
autopsies need to be performed 
routinely. 

In 2012, two laws relating to 
death investigations were passed, 
and the government established 
the Committee for the Promotion 
of Cause of Death Investigation, 
which published interim reports 
in May 2013. No governmental 
agency will take responsibility 
for the budget, however, so little 
progress has been made. 

Exposing the possible 
involvement of crime in 
unnatural deaths continues to be 
as hard as it ever was in Japan. 
Hiroshi Ikegaya Kyoto 
Prefectural University of 
Medicine, Kyoto, Japan. 
ikegaya@koto.kpu-m.ac.jp 


NEWS & VIEWS 


Near-Earth space shows its stripes 


Electrons in Earth’s magnetic field often exhibit a striped pattern of intensity as a function of electron energy and altitude. 
A model that factors in some unexpectedly important effects can now explain this feature. SEE LETTER P.338 


DREW L. TURNER 


sing some of the first scientific satellites 
put into orbit during the late 1950s, 
teams led by physicists James Van Allen 
in the United States and Sergei Vernov in the 
Soviet Union independently reported’” on 
defined regions of radiation in near-Earth 
space. These regions came to be known as 
Earth’s radiation belts, and they represent the 
first major scientific discovery of the space 
age. However, despite decades of study, many 
questions in radiation-belt physics remain 
unanswered, mostly concerning the nature of 
the inner and outer belts, which are populated 
by electrons moving at near the speed of light. 
As society becomes ever more dependent on 
satellite-based technology, it is increasingly 
important to understand the variability in the 
radiation belts, because the highest-energy 
“killer electrons”’ can result in potentially fatal 
damage to sensitive spacecraft electronics’. On 
page 338 of this issue, Ukhorskiy et al.” pre- 
sent observations and a model of a previously 
unexplained structured feature of the inner 
radiation belt, which they call zebra stripes. 
Examining early mission data from instru- 
ments for RBSPICE (the Radiation Belt Storm 
Probes Ion Composition Experiment)* on 
board NASA's twin Van Allen Probes satel- 
lites’, Ukhorskiy and colleagues found the 
distinct zebra-stripe pattern in plots of elec- 
tron intensity as a function of electron energy 
and equatorial radial distance, an example of 
which is shown in Figure la. Such a pattern 
had been inferred* in observations from the 
low-altitude Demeter spacecraft and was 
thought to be caused by interactions between 
inner-belt electrons and ultralow-frequency 
plasma waves during geomagnetic storms — 
active periods of geomagnetic activity. 
However, with the new observations from 
RBSPICE, which observed the stripes on 
most orbital passes by the satellites, it became 
clear that the stripes are a regular feature of 
the inner radiation belt. They were observed 
within an altitude of about 13,000 kilometres 
at low geomagnetic latitudes (latitudes with 
respect to Earth’s magnetic axis), even during 
quiet geomagnetic conditions. Contradictory 
to the previous hypothesis, the stripes are actu- 
ally more coherent and stable during periods of 
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Figure 1 | Zebra-stripe patterns. a, An example of observations of zebra-stripe patterns in the intensity 
of electrons in Earth’s inner radiation belt as a function of electron energy (in kiloelectronvolts) and 
distance from Earth (expressed in Earth radii from the centre of the planet). The observations were made 
by the RBSPICE instruments on board the twin Van Allen Probes satellites. Intensity is shown in units 

of electron number (counts) per square centimetre per second per steradian per keV. b, Ukhorskiy et al.° 
reproduce these observations using a model in which drifting electrons in an Earth-like magnetic field 
interact resonantly with an electric field induced by a tilt between the planet’s magnetic moment and its 
axis of rotation. (Plots modified from Fig. 1 of the paper”.) 


low geomagnetic activity, inspiring Ukhorskiy 
et al. to re-evaluate how these structured fea- 
tures might form. 

The dynamics of energetic charged parti- 
cles in Earth’s magnetic field, such as electrons 
and protons, are dominated by a combina- 
tion of electric and magnetic forces called 
the Lorentz force, and, because of the mainly 
dipolar nature of the field, radiation-belt elec- 
trons can become trapped in much the same 
manner as in the classical field arrangement 
known asa magnetic ‘bottle: The trapped par- 
ticles undergo three types of periodic motion: 
gyration around magnetic-field lines; bounce 
motion along field lines, potentially from pole 
to pole; and drift motion perpendicular to field 
lines in the azimuth (longitudinally) around 
the planet, owing to forces associated with the 
gradient and curvature of the magnetic field. 

Furthermore, because Earth’s magnetic 
moment is not parallel to its axis of rotation, 
an induced electric field is also generated. 
This oscillating electric field was thought to 
be unimportant for radiation-belt dynamics 
at Earth, because the kinetic energy of radia- 
tion-belt particles is many orders of magnitude 
stronger than that imposed by the electric field. 
However, Ukhorskiy and colleagues speculate 
that, because Earth’s rotation is independent 
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of geomagnetic activity, the electric field 
might have a role in the formation of the 
zebra stripes. 

The researchers used a model that simulates 
the dynamics of test particles in an Earth-like 
magnetic field and that includes the induced, 
oscillating electric field. The model shows 
that initially smooth distributions of electrons 
evolve into zebra-stripe patterns (Fig. 1b). Such 
patterns form because ofan interaction (phase 
resonance) between the drift motions of elec- 
trons, which are related to their energies and 
radial locations, and the diurnal oscillations of 
the induced electric field. In this interaction, 
the electrons undergo a phenomenon called 
drift phase mixing, which affects their energy 
and radial location, resulting in distinct peaks 
and troughs in electron intensity as a function 
of energy and radial location. Thus, the zebra 
stripes in the observed electron intensities are 
simply manifestations of the electron distribu- 
tions produced by such drift phase mixing. 

Ukhorskiy and colleagues’ model is rela- 
tively simple, yet it simulates the zebra stripes 
remarkably well during quiet geomagnetic 
conditions. The authors also demonstrate 
that the model can be modified to explain 
how the zebra-stripe patterns are disrupted 
by perturbations during geomagnetic storms. 


This new understanding will compel space 
physicists to re-evaluate their concept of radi- 
ation-belt dynamics and its finer complexi- 
ties. The results also shed light on the nature 
of radiation-belt dynamics in other planetary 
systems’, such as those of Jupiter, Neptune and 
Uranus, because these planets all have a fast 
rotation period and significant tilt between the 
magnetic moment and axis of rotation (unlike 
Saturn). We can look forward to testing the 


model using observations for these systems 
from future missions. = 


Drew L. Turner is in the Department 

of Earth, Planetary, and Space Sciences, 
University of California, Los Angeles, 

Los Angeles, California 90095-1567, USA. 
e-mail: dturner@igpp.ucla.edu 
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Closing the distance 
on obesity culprits 


Genetic variation in a non- protein-coding region of the gene FTO is implicated in 
obesity. A study finds evidence that, rather than affecting FTO itself, variations 
in this region influence expression of a distant gene, IRX3. SEE LETTER P.371 


DAVID U. GORKIN & BING REN 


enome-wide association studies have 
identified more than 75 locations in 
the human genome at which changes in 
DNA sequence are linked to obesity and traits 
closely related to the condition, such as body 
mass index. In these studies, single nucleotide 
changes in a non-protein-coding region (an 
intron) of the gene FTO often have the strong- 
est association with obesity'”. It is generally 
thought that this association arises because ofa 
genetic defect that alters the function or expres- 
sion level of FTO, which encodes an enzyme 
known to be involved in the control of body 
weight and metabolism in mice’. However, evi- 
dence of a direct link between the single nucleo- 
tide changes and alteration of FTO function or 
expression level has been lacking. In this issue, 
Smemo et al.° (page 371) present compelling 
evidence that the association between obesity 
and FTO actually involves another gene called 
IRX3, which encodes a transcription factor 
involved in multiple developmental processes. 
IRX3 is located a long way — roughly 
500 kilobases — from FTO’s first intron, in 
which the single nucleotide changes (also 
known as single nucleotide polymorphisms, 
or SNPs) associated with obesity are located. 
Despite previous research’ suggesting that 
long-range enhancer sequences in FTO’s 
first intron might regulate the expression of 
IRX3, attention has continued to focus on 
FTO, partly owing to a lack of more-definitive 
evidence linking these enhancers to IRX3 
expression. Now, Smemo and colleagues have 
confirmed and expanded on this link. 
Verifying a functional relationship between 
an enhancer sequence and a distant target gene 
requires a demonstration that the enhancer 


interacts physically with the target gene’s 
promoter in the cell nucleus (a promoter is a 
DNA sequence immediately adjacent to the 
beginning of a gene that is involved in regu- 
lating transcription). Smemo and colleagues 
investigated long-range interactions between 
FTO and IRX3 using a technique called chro- 
matin conformation capture, which meas- 
ures how frequently different regions of the 
genome interact*. The authors found that the 
promoter of IRX3 physically interacts with the 
first intron of FTO (Fig. 1). This interaction was 
found in mouse and zebrafish embryos, adult 
mouse brains and human cell lines, suggest- 
ing that interaction between these two regu- 
latory regions is an evolutionarily conserved 
feature of genome organization. The authors 
also found that sequences in the first intron of 
FTO show enhancer activity in several of the 
tissues in which IRX3 is expressed, and that the 
promoter of IRX3 alone cannot account for the 
gene's observed expression pattern, indicating 
that IRX3 relies on long-range regulatory input. 

Smemo and co-workers reasoned that, if the 
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association between obesity and FTO does result 
from genetic alteration of an IRX3 enhancer, 
then the same SNPs linked with obesity should 
also be associated with changes in IRX3 expres- 
sion. To investigate this, the authors examined 
IRX3 expression in 153 human brain samples. 
Obesity-linked SNPs were associated with 
IRX3 expression in these samples, but not 
with expression of FTO, directly linking these 
variants to IRX3 regulation. 

In a final, but crucial, set of experiments, 
Smemo et al. found that mice lacking a 
functional copy of Irx3 have significantly lower 
body weight than control animals, along with 
a higher base metabolic rate, less body fat and 
a predisposition to form brown over white 
adipose tissue (the latter being more likely to 
accumulate in obese individuals). This clearly 
demonstrates a previously unappreciated 
role for Irx3 in controlling body weight. Fur- 
thermore, the authors report that these same 
traits are also seen in mice when a ‘dominant- 
negative’ mutant form of Irx3, which interferes 
with the function of normal Irx3, is expressed 
in the hypothalamus, a region of the brain 
that has a key role in regulating food intake 
and metabolism. These data indicate that con- 
trol of body weight relies specifically on Irx3 
activity in the hypothalamus. 

Smemo and colleagues’ data suggest a model 
in which a particular genetic variant (or vari- 
ants) in the first intron of FTO disrupts an 
enhancer of IRX3, thus altering IRX3 expres- 
sion and, in turn, influencing body weight. 
However, important questions remain. First, 
what is the true functional variant (or variants) 
that causes the association between this region 


Figure 1 | Long-range control of gene expression. Smemo et al.° have demonstrated that a 
non-protein-coding region of the gene FTO, which has been linked to obesity, physically interacts with 
the promoter sequence required for expression of the IRX3 gene, located around 500 kilobases (kb) away. 
The authors find that sequences within this non-protein-coding region of FTO act as enhancer elements, 


regulating expression of the distant IRX3. 
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and obesity? Although many SNPs identi- 
fied in this region have been associated with 
obesity, it is likely that these associations reflect 
a single genetic defect in the region. Without 
pinpointing this defect, it will be difficult to 
determine whether the genetic mechanism 
involves disrupted regulation of FTO, IRX3 or 
perhaps both. Second, what tissues are most 
relevant to this association? The current study 
does an admirable job of examining appro- 
priate tissues wherever possible, but there 
are missing links. For example, the authors 
observed that loss of Irx3 function in the hypo- 
thalamus causes changes in body weight in 
mice, but the human brain samples were from 
the cerebellum. Additional studies are needed 
to answer these and other questions about the 
connection between obesity, FTO and IRX3. 

The implications of Smemo and co-workers 
study extend far beyond obesity. SNPs associ- 
ated with other human traits frequently occur 
at or near transcriptional enhancers’, and linear 
genomic distance is often an unreliable predic- 
tor of enhancer-promoter interactions”. The 
connection between IRX3 and FTO is particu- 
larly instructive because the enhancers of one 
gene reside within an intron of the other. How- 
ever, this is not an isolated case. For example, 
an enhancer in an intron of the gene LMBR1 
regulates the developmental gene SHH located 
more than 1 megabase away, and mutations in 
this enhancer can cause limb malformations 
owing to altered SHH expression”. Similarly, an 
enhancer within an intron of the gene HERC2 
contains a SNP that influences human eye, hair 
and skin colour by modulating the expression 
of a neighbouring gene called OCA2 (ref. 13). 
Through their study, Smemo et al. provide an 
excellent example of how detailed functional 
analyses, particularly those that link enhanc- 
ers to their target genes, can help to evaluate 
the biological consequences of variation in 
non-coding sequences. m 


David U. Gorkin and Bing Renare at 
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in the Department of Cellular and Molecular 
Medicine, University of California, La Jolla, USA. 
e-mails: dgorkin@gmail.com; biren@ucsd.edu 
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Photonics illuminates 
the future of radar 


The first implementation of a fully photonics-based coherent radar system shows 
how photonic methods for radio-frequency signal generation and measurement 
may facilitate the development of software-defined radar systems. SEE LETTER P.341 


JASON D. MCKINNEY 


intense pursuit of photonic techniques 

for the synthesis and processing of 
broadband radio-frequency signals in appli- 
cations ranging from wireless communications 
to radar. Such work is motivated by one main 
challenge: the lack of electronic systems capa- 
ble of directly generating, processing and digi- 
tizing signals that have high central frequencies 
and large bandwidths while maintaining sig- 
nal fidelity (minimal spurious-signal content 
and low noise levels). That challenge is sure to 
remain, given the escalating interest in higher 
centre frequencies and greater signal agility for 
software-defined radio and radar architectures 
— those in which the desired signal can be 
rapidly changed under software control. 
Such signal agility could offer, for example, 
increased radar resolution. On page 341 of this 
issue, Ghelfi et al.' describe the first field dem- 
onstration ofa coherent radar system based on 
photonic techniques. Their work shows that 
such techniques may indeed provide capability 
for next-generation radar systems. 

Coherent radar systems’ use both the ampli- 
tude and phase (frequency) of the radar sig- 
nal that is reflected from a given target (the 
radar return) to provide range and velocity 
data for the target. In such a system, a radar 
waveform of low centre frequency is designed 
— that is, its bandwidth is chosen — to give 
the desired range resolution. This waveform 
is then shifted to a higher centre frequency 
by multiplication with a stable, continuous- 
wave radio-frequency (RF) signal (carrier), 
and subsequently amplified and transmitted 
to the target. On reaching the radar receiver, 
the waveform is shifted to a lower, intermediate 
frequency, again through multiplication with 
a stable RF signal, and processed to extract the 
target’s range and velocity. 

Information on range is acquired from the 
time of arrival of the radar return. The range 
resolution is determined by the waveform’s 
bandwidth-limited duration, and improves as 
this duration decreases. The need for increased 
resolution requires the use of higher carrier 
frequencies, which are necessary to obtain 
waveforms of shorter duration. Velocity data 
are obtained by comparing the frequency 


E: well over a decade, there has been 


310 | NATURE | VOL 507 | 20 MARCH 2014 


© 2014 Macmillan Publishers Limited. All rights reserved 


of the return waveform against a stable, 
continuous-wave reference signal of the same 
frequency as that of the original radar wave- 
form. For a moving target, the return waveform 
will be shifted in frequency relative to its origi- 
nal frequency owing to the Doppler effect (at 
acoustic frequencies, this effect is analogous 
to the way in which the perceived pitch of a 
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Figure 1 | A photonics-based radar system. At the 
heart of the radar transmitter—receiver (transceiver) 
implemented by Ghelfi et al.' isa mode-locked laser 
(MLL). This laser is connected by optical fibres (blue 
arrows) to a photonics-based radio-frequency (RF) 
generator, in which the laser output is first optically 
modulated and then converted to electronic form 
to obtain a radar signal. The MLL is also connected 
to a photonics-based analog-to-digital converter 
(ADC). The generator’s output is connected to 

an antenna by an RF cable (left red arrow) and 
transmitted towards a target. The signal reflected 
from the target (radar return) is collected by the 
antenna and directed by means of the RF cable 
(right red arrow) to the ADC. The ADC uses a train 
of pulses from the MLL to optically measure the 
amplitude of the reflected signal at discrete instances 
in time with high precision (optical sampling). The 
resulting optical samples are converted to digital 
electronic form and transmitted (black arrow) to 
the digital-signal processor of a computer. (Graphic 
adapted from Fig. 2 of the paper’.) 


siren varies as it passes an observer). Accurate 
velocity measurement demands that the fre- 
quency of both the carrier and the reference 
signals be known with a high degree of cer- 
tainty — that is, that their phase noise is low. 

In a digital coherent radar, processing of 
the radar return is accomplished using a com- 
puter. Such processing requires the use of a 
high-precision analog-to-digital converter 
(ADC), which measures the amplitude of the 
return waveform at discrete instances in time 
and converts these measurements (samples) 
to digital data suitable for computer process- 
ing. This digitization requires that the time 
at which each sample is taken in the ADC is 
known to high accuracy, which in turn neces- 
sitates that the timing jitter — the variation 
in the time between successive samples — 
is minimal. Low phase noise and minimal 
timing jitter are intimately related: timing 
jitter is directly proportional to the phase noise 
mathematically integrated in frequency’. Min- 
imizing the phase noise therefore minimizes 
the timing jitter. 

Enter the mode-locked laser — a class of 
laser in which the optical modes of oscillation 
maintain a highly stable phase relationship 
(hence the term mode-locked). This phase sta- 
bility results in a periodic train of laser pulses 
exhibiting low timing jitter. When two optical 
modes are selected by an optical filter placed 
beyond the laser and subsequently directed 
to a light sensor known as a photodiode, the 
sensor's electrical output is an RF signal with 
low phase noise. The frequency of this signal 
is readily tuned on the basis of the two optical 
modes that are selected, and the efficiency with 
which the RF signal is created is determined 
by the electrical response of the photodiode 
at the chosen frequency. For coherent-radar 
applications, such a laser provides an appeal- 
ing tool that can both generate tunable low- 
phase-noise RF signals and serve as the basis 
for a low-jitter ADC. This is precisely what 
Ghelfi et al. have done to build a coherent radar 
system as part of a project called PHODIR 
(‘Photonics-based fully digital radar’). 

In this project, a single mode-locked laser is 
used to generate the required continuous-wave 
RF signals and to sample the return waveforms 
in an optical ADC. Such operations are at the 
centre of the project’s photonics-based radar 
transmitter—receiver, or transceiver (Fig. 1). 
Laboratory measurements of the resulting 
continuous-wave signals showed that signals 
with a frequency near 40 gigahertz exhibit 
integrated phase noise and timing jitter that 
are approximately half those of state-of-the-art 
conventional RF synthesizers. More impres- 
sively, because the timing jitter of the laser- 
pulse train is about ten times lower than that of 
the best electronic ADCs, the authors achieved 
record-precision digitization of continuous- 
wave signals at 40 GHz. This optical ADC has 
about 100 times better digitization fidelity than 
its best electronic equivalent. And the device 
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showed comparable digitization fidelity to that 
of similar photonic ADCs‘ over a bandwidth 
that was ten times larger — a remarkable result. 

Ina first-of-its-kind demonstration, Ghelfi 
and colleagues next incorporated their pho- 
tonics-based transceiver into a coherent radar 
architecture operating at a carrier frequency 
of about 10 GHz, and tested it in a real-world 
situation by measuring the take-off trajectories 
of several aircraft. They compared the resulting 
range and velocity data against publicly avail- 
able trajectory data from independent sources 
and found excellent agreement between the 
two data sets. The results illustrate that a 
photonics-based transceiver can be success- 
fully used in a radar system. 

The individual performances of the 
system's transmitter and ADC are world-class 
with respect to those of other photonics-based 
devices. Furthermore, the readily tunable 
carrier and reference signals, as well as the 
photonic ADC, remove much of the complex- 
ity of frequency translation that is inherent in 
all-electronic radar implementations. Cer- 
tainly, these elements, and their combination 
into a functional system, are appealing com- 
ponents for future frequency-agile, software- 
defined radar architectures. 

But an important question must be 
addressed: can photonics-based radars achieve 
similar or better performance than their all- 
electronic counterparts? Answering this ques- 
tion will require direct comparison of photonic 
and all-electronic versions of the same radar. 
Compared with the system described here, 
current commercial digital radar systems 
operating at a 10-GHz carrier frequency show 
substantially higher dynamic range (the range 
of received-signal power over which the radar 
receiver responds linearly). Because dynamic 
range translates directly into radar sensitivity 
and false-target detection, a high value for this 
quantity is crucial. 

Another concern is one of radar range, the 
distance over which the radar can successfully 
detect a target. Increased range requires longer 
processing time, placing more stringent limits 
on long-term timing jitter and phase noise at 
a low frequency offset from the carrier signal 
— quantities that are difficult to minimize’ in 
photonics-based architectures. 

Ghelfi and colleagues’ study raises yet other 
questions. In what types of radar system could 
photonics have the most impact? Given a par- 
ticular radar application, what requirements 
(associated with, for example, timing jitter and 
phase noise) are placed on our techniques, and 
can they be achieved? Pursuing the answers 
to these questions will no doubt lead to excit- 
ing — and challenging — research in the years 
to come. m 
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The disassembly 


of death 


During the cell-death program known as apoptosis, cells break up into 
membrane-bound fragments. It emerges that this process is controlled by the 
protein pannexin 1 and can be deregulated by an antibiotic. SEE ARTICLE 2.329 


CHRISTOPHER D. GREGORY 


( ¥ death is a feature of normal life. 
Around the clock, cells are disassem- 
bled by an evolutionarily conserved 

controlled-death program called apoptosis, 
which ensures proper regulation of the size and 
quality of cell populations in tissues. Much is 
known about the molecular mechanisms that 
trigger this program, but it is unclear how 
dying cells are dismantled in a controlled 
manner once apoptosis is under way. On 
page 329 of this issue, Poon et al.' demon- 
strate a central role for a cell-membrane chan- 
nel, the pannexin 1 protein, in inhibiting cell 
disassembly during apoptosis, and report that, 
unexpectedly, a member of a commonly pre- 
scribed class of antibiotic is able to modulate 
the activity of such channels. 

Pannexin 1 (PANX1) channels are complex 
structures that span the plasma membrane of 
cells, regulating the movement of small mol- 
ecules of various sizes between the intracellular 
and extracellular milieux. These channels are 
best known for controlling the release of ATP, 
a multifunctional energy-carrying and signal- 
ling molecule. It was previously shown’ that 
PANX1 channels seemingly open up to release 
ATP from the cell during apoptosis. This release 
acts as a ‘find-me signal for phagocytes — cells 
that seek out, engulf and digest apoptotic cells 
and their fragments (Fig. 1a). In addition to 
ATP, PANX1 channels are likely to control the 
efflux and influx of a wide variety of bioactive 
molecules from many cell types in various states 
of activation and differentiation, as well as cell 
death*. Conveniently, PANX1-channel open- 
ing can be monitored using surrogate markers, 
because fluorescent DNA-binding dyes such 
as TO-PRO-3 are among the molecules that 
selectively enter cells through these channels™. 

In their groundbreaking paper, Poon and 
colleagues demonstrate that rigorous molec- 
ular screening, when coupled with the power 
of observation in a well-regulated setting, can 
uncover surprising and invaluable findings. 


The authors designed an assay in which uptake 
of TO-PRO-3 into apoptotic human cells was 
monitored as an indicator of PANX1-channel 
activity. They subjected this assay to a widely 
available library of 1,280 small molecules, 
including the quinolone-based antibiotic, 
trovafloxacin. Unexpectedly, trovafloxacin 
proved to be a potent and direct pharmacologi- 
cal inhibitor of PANX1 channels. The authors 
demonstrated that this role was independent 
of apoptosis, because trovafloxacin was able 
to inhibit individual, non-apoptotic channels. 


Apoptosis 


Apoptosis 


fragmentation 


Poon and co-workers went on to study the 
effects of trovafloxacin on apoptotic cells and 
discovered that PANX1 channels not only 
regulate ATP release during apoptosis, but 
also control the formation of apoptotic bod- 
ies, membrane-bound cell fragments that 
bud off from the cell during apoptosis. Inhi- 
bition of normal PANX1-channel function 
with trovafloxacin resulted in a large increase 
in the number of apoptotic bodies formed, 
causing dysregulated fragmentation of the 
cell (Fig. 1b). The authors also found that 
active PANX1 channels prevent the forma- 
tion of string-like structures (which they term 
apoptopodia) that arise during inhibition of 
channel activity in apoptotic cells. Somehow 
apoptopodia prevent the detachment of blebs 
— irregular bulges in the plasma membrane — 
that commonly appear at cell surfaces during 
the early stages of apoptosis. 

The purpose of apoptotic bodies is currently 
unknown. Each apoptotic body has a diverse 
mixture of components, and may contain 
DNA, fragments of the cell nucleus, other 
intact organelles and a wide array of intra- 
cellular proteins and other molecules, all 
encapsulated by portions of the membrane 
that previously surrounded the living cell. 
Although the bodies are rapidly ‘gobbled up’ 
by phagocytes, whole dying cells can also be 
engulfed effectively; in fact, not all apoptotic 
cells break up. Perhaps apoptotic bodies are 
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Figure 1 | PANX1 channels regulate cell disassembly. a, PANX1 channels control the passage of 
molecules across the membrane in viable cells (green arrows). When cells die by apoptosis, membrane- 
bound fragments called apoptotic bodies bud off from the cell. PANX1 channels release ATP molecules 
from the cell and allow entry of selected molecules such as the fluorescent dye TO-PRO-3 (blue arrows). 
ATP acts as a ‘find-me'’ signal to phagocytes that engulf both the dying cell and the apoptotic bodies. b, Poon 
et al.’ report that PANX1-channel function is inhibited by the antibiotic trovafloxacin, which modulates the 
channel’ activity in viable cells (red arrows). Channel inhibition also results in the formation of many 

more apoptotic bodies during apoptosis, leading to dysregulated fragmentation of the cell. The study 
reveals that apoptotic bodies form in association with string-like structures, termed apoptopodia. 
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more efficiently degraded than whole apop- 
totic cells within the phagocyte, or maybe they 
are targeted or degraded differently. Apoptotic 
bodies may also be important in controlling 
immune responses to apoptotic cells”. 

It is possible that apoptotic bodies retain 
biological functions and act at sites well away 
from the dying cell. Given that apoptosis can 
drive cell regeneration in tissues° — an attrac- 
tive replacement of the old with the new — it 
is particularly tempting to suggest that apop- 
totic-cell-derived bodies transport active com- 
ponents that contribute to these regenerative 
processes. Small vesicles have been found by 
others to export bioactive cargoes from non- 
apoptotic cells in response to various stimuli’. 
Although such vesicles are generally smaller 
than apoptotic bodies, the two types have 
certain structural characteristics in common. 
Exactly how PANX1 channels contribute to 
the functional (including possible pathologi- 
cal) attributes of apoptotic bodies, and perhaps 
other extracellular vesicles, will be valuable 
avenues of future investigation. 

Poon and colleagues’ work raises an issue 
of relevance to improved antibiotic design. 
Trovafloxacin, a highly effective, broad- 
spectrum fluoroquinolone antibiotic, was 
withdrawn because of adverse side effects in 
humans’. Fluoroquinolones in general repre- 
sent some of the most commonly prescribed 
antibiotics, but they carry the risk of severe 
side effects that can harm multiple organ sys- 
tems’. Could abrogating unwanted targeting 
of PANX1 channels by these drugs offer an 
opportunity to develop improved antibiotics? 
As yet, we do not know whether dysregulated 
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production of apoptotic bodies has patho- 
logical consequences, or whether another, 
unidentified mechanism could cause adverse 
clinical effects when patients are treated with 
trovafloxacin — a mechanism that might 
be dependent or independent of PANX1- 
channel inhibition. What is crucial to deter- 
mine is the functional diversity of PANX1 
channels, and the pathological consequences 
of their blockade. 

As always, the unexpected is the most 
fascinating. There is no doubt that this work 
will stimulate researchers in the seemingly 
disparate areas of apoptosis and antibiotics to 
find out more about PANX1. The results may 
improve not only scientific understanding but 
also health care. m 


Christopher D. Gregory is at the 
MRC/University of Edinburgh Centre for 
Inflammation Research, Queen’s Medical 
Research Institute, University of Edinburgh, 
Edinburgh EH16 4T], UK. 

e-mail: chris.gregory@ed.ac.uk 


. Poon, |. K. H. et al. Nature 507, 329-334 (2014). 

. Elliott, M. R. et al. Nature 461, 282-286 (2009). 

. Chekeni, F. B. et al. Nature 467, 863-867 (2010). 

. Sandilos, J. K. & Bayliss, D. A. J. Physiol. (Lond.) 590, 

6257-6266 (2012). 

5. Wickman, G., Julian, L. & Olson, M. F. Cell Death 
Differ. 19, 735-742 (2012). 

6. King, R. S. & Newmark, P. A. J. Cell Biol. 196, 
553-562 (2012). 

7. Hugel, B., Martinez, M. C., Kunzelmann, C. & 
Freyssinet, J. M. Physiology 20, 22-27, (2005). 

8. Lucena, M. |. etal. Clin. Infect. Dis. 30, 400-401 (2000). 

9. Liu, H. H. Drug Safety 33, 353-369 (2010). 


BRWNHE 


This article was published online on 12 March 2014. 


Vessels of rejuvenation 


The identification of specialized endothelial-cell populations in the blood vessels 
of bones, and their signalling pathways, reveals how the vasculature contributes 
to bone formation. SEE ARTICLE P.323 & LETTER P.376 


FERDINAND LE NOBLE & JOS LE NOBLE 


r | Nhe different structural and metabolic 
requirements of organs and tissues 
prompt the formation of specialized 

blood vessels during development’, which 

deliver molecules that instruct tissues on their 
fates”’. This process requires crosstalk between 
organs and the vasculature: organs generate 
local growth factors that attract blood vessels 
toward metabolically active regions, and the 
vessels release molecules (known as angiocrine 
signals) that affect cell differentiation in the 
developing organs. It is becoming increasingly 
clear that these interactions are influenced 
by the endothelial cells that line the inner 


surface of blood vessels. In two papers*” in 
this issue, Adams and colleagues demonstrate 
that the bone vasculature contains specialized 
endothelial cells with signalling properties that 
support bone maturation and regeneration. As 
well as providing insight into the tissue speci- 
ficity of angiocrine signalling, the findings 
have direct clinical relevance. 

In the first paper, Kusumbe et al.” (page 323) 
began by mapping gene expression, metabolic 
activity and cell-surface markers in the devel- 
oping postnatal bone vasculature of mice. The 
authors distinguished two new subpopulations 
of endothelial cell (EC) that they term type H 
and type L ECs. Interestingly, they found that 
osteoprogenitor cells, which differentiate into 
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50 Years Ago 


The Agar Precipitation Technique and 
Its Application as a Diagnostic and 
Analytical Method by F. Peetoom — 
Nomenclature is not discussed 
critically, and the style is diffuse and 
difficult to read. A translation which 
makes no better attempt at English 
style than “After the eleven each lasting 
one minute incubations the serum 
hardly contained ...* etc. (p. 88), the 
strange punctuation, and the various 
misprints should not have appeared 
from a British publishing house. 
From Nature 21 March 1964 


100 Years Ago 


‘An Elizabethan Cookery-book 

A Proper Newe Booke of Cokerye 
edited by Catherine E Frere — 

The original Cambridge edition 

is dateless ... the book was often 
reprinted before 1546, and was 

... the “Book of Cookery” of 1500 
... As we turn over the leaves of 
the “Proper Newe Booke,’ with 

its quaint recipes, couched in the 
“corrupted phonetic” of the golden 
age of English prose, we gather...“a 
little rushlight illumination on the 
culinary mysteries of the once busy 
kitchens, roofless and empty to-day, 
and on the hospitalities, feasting, 
and revels of the now silent dining 
halls of long ago” 


ALSO: 


Prof. Arthur Keith’s discourse on 

an anthropological study of some 
portraits of Shakespeare ... is 
printed in... the British Medical 
Journal ... Examination of the head 
of Shakespeare proves him to bea 
representative example of the short- 
headed type, not of the early British 
breed, but of the round-headed race 
which entered Britain in the Bronze 
period about 2000 B.c. His brain 
capacity was more than 1900 cm., as 
compared with 1477 in an average 
Englishman. 

From Nature 19 March 1914 
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Comprehensive molecular characterization 


of urothelial bladder carcinoma 


The Cancer Genome Atlas Research Network* 


Urothelial carcinoma of the bladder is a common malignancy that causes approximately 150,000 deaths per year world- 
wide. So far, no molecularly targeted agents have been approved for treatment of the disease. As part of The Cancer 
Genome Atlas project, we report here an integrated analysis of 131 urothelial carcinomas to provide a comprehensive land- 
scape of molecular alterations. There were statistically significant recurrent mutations in 32 genes, including multiple 
genes involved in cell-cycle regulation, chromatin regulation, and kinase signalling pathways, as well as 9 genes not previ- 
ously reported as significantly mutated in any cancer. RNA sequencing revealed four expression subtypes, two of which 
(papillary-like and basal/squamous-like) were also evident in microRNA sequencing and protein data. Whole-genome 
and RNA sequencing identified recurrent in-frame activating FGFR3-TACC3 fusions and expression or integration of 
several viruses (including HPV16) that are associated with gene inactivation. Our analyses identified potential therapeutic 
targets in 69° ofthe tumours, including 42° with targets in the phosphatidylinositol-3-OH kinase/ AKT/mTOR pathway 
and 45% with targets (including ERBB2) in the RTK/MAPK pathway. Chromatin regulatory genes were more frequently 
mutated in urothelial carcinoma than in any other common cancer studied so far, indicating the future possibility of 


targeted therapy for chromatin abnormalities. 


Urothelial carcinoma of the bladder is a major cause of morbidity and 
mortality worldwide, causing an estimated 150,000 deaths per year’. 
Previous studies have identified multiple regions of somatic copy number 
alteration, including amplification of PPARG, E2F3, EGFR, CCND1 
and MDM2, as well as loss of CDKN2A and RBI (refs 2, 3). Sequencing 
of candidate pathways has identified recurrent mutations in TP53, 
FGFR3, PIK3CA, TSC1, RBI and HRAS (refs 2, 3). Whole-exome sequenc- 
ing of nine bladder cancers, followed by a replication analysis of 88 
cancers, identified mutations at >10% frequency in several chromatin 
remodelling genes: KDM6A, CREBBP, EP300 and ARIDIA (ref. 4). 
Focused molecular analyses*® have delineated tumour subtypes and 
identified kinase-activating FGFR3 gene fusions”®. 

We report here a comprehensive, integrated study of 131 high-grade 
muscle-invasive urothelial bladder carcinomas as part of The Cancer 
Genome Atlas (TCGA) project. Included are dataon DNA copy number, 
somatic mutation, messenger RNA and microRNA (miRNA) expres- 
sion, protein and phosphorylated protein expression, DNA methyla- 
tion, transcript splice variation, gene fusion, viral integration, pathway 
perturbation, clinical correlates and histopathology to characterize the 
molecular landscape of urothelial carcinoma. This study identifies a 
number of mutations and regions of copy number variation that involve 
genes not previously reported as altered in a significant fraction of blad- 
der cancers. It also identifies potential therapeutic targets in most of the 
samples analysed. 


Demographic, clinical and pathological data 

Samples (from 19 tissue source sites) consisted of 131 chemotherapy- 
naive, muscle-invasive, high-grade urothelial tumours (T2-T4a, Nx, 
Mx), as well as peripheral blood (nm = 118) and/or tumour-adjacent, 
histologically normal-appearing bladder tissue (n = 23). Cases were 
retained only if they met the following criteria: tumour nuclei con- 
stituted =60% of all nuclei; tumour necrosis was =20% of the specimen; 
and variant histologies (squamous or small cell) were =50% (Supplemen- 
tary Information, section ‘Biospecimen collection and clinical data’). 
Clinical and demographic characteristics are described in Supplementary 


Data 1.1. Five expert genitourinary pathologists re-reviewed all of the 
cases for multiple parameters, including the extent of variant histology 
(Supplementary Fig. 1.la and Supplementary Information, section 
‘Biospecimen collection and clinical data’). 


Somatic DNA alterations 


The tumours displayed a large number of DNA alterations, slightly 
fewer than in lung cancer and melanoma, but more than in other adult 
malignancies studied by TCGA (Fig. 1)’. On average, there were 302 
exonic mutations, 204 segmental alterations in genomic copy number 
and 22 genomic rearrangements per sample. We analysed somatic copy 
number alterations (CNAs) using both SNP 6.0 arrays and low-pass 
whole-genome sequencing; the two were strongly concordant (Sup- 
plementary Methods 6.1 and Supplementary Fig. 6.1). There were 22 
significant arm-level copy number changes (Supplementary Data 6.1.1), 
and GISTIC (genomic identification of significant targets in cancer) 
(Supplementary Methods 6.2) identified 27 amplified and 30 deleted 
recurrent focal somatic CNAs (Supplementary Data 6.2.1 and 6.3.1). 
Focal amplifications involved genes previously reported to be altered in 
bladder cancer (Fig. 1c and Supplementary Fig. 6.2.1) and some not 
previously implicated. The latter included PVRL4, BCL2L1 and ZNF703. 
The most common recurrent focal deletion, seen in 47% of samples, 
contained CDKN2A (9p21.3) and correlated with reduced expression 
(Fig. 1 and Supplementary Fig. 2.7). Other focal deletions containing 
<10 genes appeared to target PDE4D, RB1, FHIT, CREBBP, IKZF2, 
FOXQI1, FAM190A (also called CCSER1), LRP1B and WWOX. 
Whole-exome sequencing of 130 tumours and matched normal sam- 
ples targeted 186,260 exons in 18,091 genes (mean coverage 100-fold, 
with 82% of target bases covered >30X). MuTect’? identified 39,312 
somatic mutations (including 38,012 point mutations and 1,138 indels 
(insertions or deletions)), yielding mean and median somatic mutation 
rates of 7.7 and 5.5 per megabase (Mb), respectively (Fig. 1a and Sup- 
plementary Table 2.1.1). Thirty-two genes showed statistically significant 
levels of recurrent somatic mutation (Fig. 1b and Supplementary 
Table 2.1.2) by analysis using MutSig 1.5 (refs 9, 11) (Supplementary 


*A list of authors and affiliations appears at the end of the paper. 
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Figure 1 | The genomic landscape of bladder cancer. a, Mutation rate and 
type, histological subtype, smoking status, gender, tumour stage and cluster 
type. b, Genes with statistically significant levels of mutation (MutSig, false 
discovery rate <0.1) and mutation types. c, Deletions and amplifications for 
genomic regions with statistically significant focal copy number changes 
(GISTIC2.0). ‘Copy number’ refers to absolute copy number. Note that two 
amplification peaks (*) contain several genes, any of which could be the target, 


Methods 2.2). Three other genes identified by MutSig were not con- 
sidered further because of low or undetectable expression (Supplemen- 
tary Fig. 2.1.1). A similar analysis considering only mutations in the 
COSMIC database’ identified three more significantly mutated genes: 
ERBB2, ATM and CTNNBI (Supplementary Table 2.1.3). We validated 
the mutation findings in three ways: targeted re-sequencing of all 
significantly mutated gene mutations, comparison with RNA-seq data 
for 123 samples and comparison with whole-genome sequence data for 
18 samples. Overall, the validation rate was >99% in selected muta- 
tions by a combination of the methods (Supplementary Methods 2.4). 
Nearly half (49%) of the samples had TP53 mutations (Fig. 1b), 
which were mutually exclusive in their relationship with amplification 
(9%) and overexpression (29%) of MDM2; hence, TP53 function was 
inactivated in 76% of samples. Most RBI mutations were inactivating, 
were associated with significantly reduced mRNA level (Supplemen- 
tary Fig. 2.7) and were mutually exclusive with CDKN2A deletions 
(Supplementary Fig. 2.8 and Supplementary Table 2.8.1). FGFR3 muta- 
tions (12%) typically affected known kinase-activating sites. PIK3CA 
mutations were relatively common (20%), clustering in the helical 
domain near E545 (Supplementary Fig. 2.4). Most TSC1 mutations (8%) 
were truncating, and six were homozygous (allele fraction >0.5). 
Many of the 32 genes identified in Fig. 1b have not previously been 
reported as statistically significantly mutated in bladder cancer: MLL2 
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as opposed to the single gene listed here. d, RNA expression level for selected 
genes, expressed as fold change from the median value for all samples. Tumour 
samples were grouped into three clusters (red, blue and green) using consensus 
NMF clustering (see the main text and Supplementary Fig. 2.1.2). Three 
samples with no copy number data and two samples with no mutations in the 
genes were not used in the clustering and are shown in grey. 


(also called KMT2D; 27%), CDKN1A* (14%), ERCC2* (12%), STAG2 
(11%), RXRA* (9%), ELF3* (8%), NFE2L2 (8%), KLF5* (8%), TXNIP 
(7%), FOXQ1* (5%), RHOB* (5%), FOXA1 (5%), PAIP1* (5%), BTG2* 
(5%), ZFP36L1 (5%), RHOA (4%) and CCND3 (4%). The nine genes 
marked with asterisks have not been reported as significantly mutated 
genes in any other TCGA cancer type or reported in another study as 
mutated at >3% frequency”. CDKNIA (p21“""), a cyclin-dependent 
kinase inhibitor’’, had predominantly null or truncating mutations, 
indicating loss of function. Fifteen of sixteen mutations in ERCC2, a 
nucleotide excision repair gene” , were deleterious missense mutations, 
suggesting dominant-negative effects. ERCC2-mutant tumours also 
had significantly fewer C>G mutations than did ERCC2-wild-type 
tumours (Supplementary Figs 2.3.1 and 2.3.2), and they trended towards 
higher overall mutation rate (Supplementary Fig. 2.12). Seven of twelve 
mutations in RXRA (retinoid X nuclear receptor alpha)'* occurred at 
the same amino acid (five S$427F; two S427Y) in the ligand-binding 
domain. Those seven tumours showed increased expression of genes 
involved in adipogenesis and lipid metabolism (Supplementary Fig. 2.6 
and Supplementary Data 2.6.1-2.6.3), suggesting that the mutations 
cause constitutive activation. 

Eleven tumours (8%) had deleterious missense mutations in the 
Neh2 domain of NFE2L2, a transcription factor that regulates the 
anti-oxidant program in response to oxidative stress’*. Those tumours 
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showed markedly increased expression of genes involved in genotoxic 
metabolism and the reactive oxygen species (ROS) response (Supplemen- 
tary Figs 2.5.1-2.5.3 and Supplementary Data 2.5.2). Furthermore, 
nine samples had mutations in redox regulator TXNIP (ref. 16) (five 
of them inactivating) and were mutually exclusive of samples with 
NFE2L2 mutations, providing another mechanism for dysregulation 
of redox metabolism. Predominant inactivating mutations were seen in 
STAG2, an X-linked cohesin complex component required for separation 
of sister chromatids during cell division’” (Supplementary Fig. 2.4). 

Unsupervised clustering by non-negative matrix factorization of 
mutations and focal somatic CNAs in 125 samples identified three 
distinct groups (Fig. la and Supplementary Fig. 2.1.2). Group A (red), 
classified as ‘focally amplified’, is highly enriched in focal somatic 
CNAs in several genes, as well as mutations in MLL2 (Fig. 1 and Sup- 
plementary Tables 2.1.4 and 2.1.5). Group B (blue), classified as “pap- 
illary CDKN2A-deficient FGFR3 mutant’, is enriched in papillary 
histology. Nearly all group B samples show loss of CDKN2A, and most 
have one or more alterations in FGFR3. Group C (green), classified as 
“TP53/cell-cycle-mutant’, shows TP53 mutations in nearly all samples, 
as well as enrichment with RB1 mutations and amplifications of E2F3 
and CCNE1 (Fig. 1 and Supplementary Table 2.1.4). These differences 
in pattern of mutation suggest the possibility of different oncogenic 
mechanisms. 

Seventy-two per cent of the cancers in this study were from current 
or past smokers, consistent with extensive epidemiological studies 
indicating an association between smoking and urothelial cancer risk. 
In contrast with lung cancer, however, there was no statistically signi- 
ficant association between smoking status and the mutational spectrum, 
frequency of mutation in any significantly mutated gene, occurrence of 
focal somatic CNAs or expression subtype (Supplementary Tables 2.9.1 
and 2.9.2). Never-smokers did have a slightly higher fraction of C>G 
mutations than did current/former smokers (28.5% versus 23.8%, 
P= 0.032; Supplementary Figs 2.3.2 and 2.3.3). Unsupervised cluster- 
ing of promoter CpG island DNA methylation data revealed a major 
subgroup (34%) of tumours (CIMP) characterized by cancer-specific 
DNA hypermethylation (Supplementary Fig. 7.1). Multivariate regres- 
sion analysis with age, sex and tumour stage as covariates identified 
smoking pack-years as the only significant predictor of CIMP pheno- 
type, as has also been reported for colorectal cancer’. 

Fifty-one per cent of mutations overall were Tp*C->(T/G) (Sup- 
plementary Table 2.1.1), a class of mutation recently reported to be 
mediated by one of the DNA cytosine deaminases, APOBEC (refs 19, 20). 
APOBEC3B was expressed at high levels in all of the tumours, suggest- 
ing a major role for APOBEC-mediated mutagenesis in bladder carci- 
nogenesis (Supplementary Figs 12.1 and 12.2). 

Four genes involved in epigenetic regulation were significantly mutated 
genes: MLL2, ARID1A, KDM6A and EP300 (Fig. 1). Truncating muta- 
tions were significantly enriched in each of those genes (Supplemen- 
tary Fig. 2.2 and Supplementary Data 2.2.1-2). Three of the genes had 
previously been identified as mutated in urothelial cancers’, but muta- 
tion of MLL2, which encodes a histone H3 lysine 4 (H3K4) methyl- 
transferase, is a novel finding. Several other chromatin-regulating genes 
had mutation rates =10% but were not statistically significant by 
MutSig analysis: MLL3, MLL, CREBBP, CHD7 and SRCAP. Many other 
epigenetic regulators were mutated at lower frequency but were also 
enriched with truncating mutations, indicating functional significance 
(Supplementary Fig. 2.2 and Supplementary Data 2.2.1 and 2.2.2). 
Non-silent mutations in chromatin regulatory genes overall were sig- 
nificantly enriched in bladder cancer in comparison with the entire 
exome, in contrast with all other epithelial cancers studied so far in the 
TCGA project (Supplementary Table 2.10). Mutations in MLL2 and 
KDM6OA (the latter encoding a histone H3 lysine 27 (H3K27) demethy- 
lase) were mutually exclusive (Supplementary Fig. 2.8 and Supplemen- 
tary Table 2.8.1), suggesting that mutations in the two genes have 
redundant downstream effects on carcinogenesis or that the combined 
loss is synthetically lethal. 


ARTICLE 


Chromosomal rearrangements and viral integration 


To identify structural variations and pathogen sequences, we used low- 
pass, paired-end, whole-genome sequencing (WGS; 6-8 X coverage) of 
114 tumours and RNA sequencing of all tumours. We detected 2,529 
structural aberrations, including 1,153 that involve gene-gene fusions. 
Among the translocations, 379 were inter-chromosomal, 237 were 
intra-chromosomal, 274 were the result of inversions and 263 resulted 
from deletions (Supplementary Table 3.1). We found several recurrent 
translocations of probable pathogenic significance, including an intra- 
chromosomal translocation on chromosome 4 involving FGFR3 and 
TACC3 (n = 3). The breakpoints were in intron 16 (two cases) or exon 
17 (one case) of FGFR3 and intron 10 of TACC3 (confirmed by DNA 
sequencing and RNA-seq). All three lead to fusion mRNA products for 
which the predicted proteins include the amino-terminal 758 amino 
acids of FGFR3 fused with the carboxy-terminal 191 amino acids of 
TACC3 (Fig. 2a). On the basis of the structure of the FGFR3-TACC3 
fusion protein, we predict that it can auto-dimerize, leading to con- 
stitutive activation of the kinase domain of FGFR3. FGFR3-TACC3 
fusion, which was recently described in both glioblastoma”! and blad- 
der cancer’®, represents a promising therapeutic target. The ERBB2 gene 
was also involved in translocations in four tumours, all with different 
fusion partners and all confirmed by DNA sequencing, RNA-seq or 
both. In one case, exons 4 to 29 of ERBB2 were fused to the promoter 
plus exon 1 of DIP2B, and the fusion product was amplified (Fig. 2b). 
Two other fusion products resulted in novel mRNA products, the bio- 
logical significance of which is not known. 

We identified viral DNAs in 7 of 122 tumours (6%), and viral tran- 
scripts in 5 of 122 (4%). Three tumours expressed cytomegalovirus (CMV) 
transcripts (encoding RL5A, RNA2.7, RL9A, RNA1.2, UL5 and UL22A), 
one expressed BK polyoma virus and one expressed human papilloma 
virus 16 (HPV16). HPV16 and human herpesvirus 6B DNA were each 
identified in one other sample but without expression. None of the 
tumours expressing CMV showed evidence of CMV integration into 
the host genome, suggesting the presence of a stable episome. In the 
BK-positive tumour, two BK genes were integrated into GRB14, a sig- 
nalling adaptor protein for receptor tyrosine kinases. In the HPV-16- 
expressing case, the virus integrated into BCL2L1, an apoptosis-regulating 
gene (Fig. 2c). In that tumour, BCL2L1 was amplified (~6x) and 
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Figure 2 | Structural rearrangements and viral integration. a, FGFR3- 
TACC3 fusion in sample TCGA-CF-A3MH showing the breakpoints in the 
two genes, the breakpoint junction sequences and the predicted fusion protein. 
b, Rearrangement involving DIP2B and ERBB2 in TCGA-DK-A2I6. The 
ERBB2 gene has swapped its promoter with that of DIP2B, resulting in 
overexpression of ERBB2. c, Insertion of human papilloma virus 16 (HPV 16) 
into the BCL2L1 gene on chromosome 20 in TCGA-GC-A3I6. The region of 
BCL2L1 into which the virus has integrated and the integration junction 
sequence are shown. 
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overexpressed (~10X median; >2% any of the other samples). Overall, 
these findings indicate that viral infection may have a role in the 
development of a small percentage of urothelial carcinomas. 


mRNA, miRNA and protein expression 
Analysis of RNA-seq data from 129 tumours identified four clusters 
(clusters I-IV) (Fig. 3 and Supplementary Fig. 4.1). Cluster I (‘papillary- 
like’) is enriched in tumours with papillary morphology (P = 0.0002), 
FGFR3 mutations (P = 0.0007, q = 0.02), FGFR3 copy number gain 
(P = 0.04, q = 0.1) and elevated FGFR3 expression (P < 0.0001) (Fig. 3a). 
It includes all three samples with FGFR3-TACC3 fusions. Cluster I 
samples also show significantly lower expression of miR-99a and miR-100, 
miRNAs that downregulate FGFR3 expression (P = 0.0002, Figs 3a 
and Supplementary Fig. 5.3)**. Cluster I samples also show lower 
expression of miR-145 and miR-125b, which have been reported as 
frequently downregulated in bladder cancer**. Tumours with FGFR3 
alterations, and perhaps other tumours that share the cluster I expression 
profile, may respond to inhibitors of FGFR or its downstream targets. 

Reverse-phase protein array (RPPA) data indicate that clusters I 
and II express high HER2 (ERBB2) levels and an elevated oestrogen 
receptor beta (ESR2) signalling signature, indicating potential targets 
for hormone therapies such as tamoxifen or raloxifene (Fig. 3d). In 
fact, HER2 protein levels in a subset of the tumours are comparable to 
those found in TCGA HER2-positive breast cancers”’. 

For comparison, we asked whether any of the four clusters show 
gene signatures similar to those identified in any other tumour type(s) 
among the first 11 analysed by TCGA. We found that the signature of 
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Figure 3 | Expression characteristics of bladder cancer. Integrated analysis 
of mRNA, miRNA and protein data led to identification of distinct subsets of 
urothelial carcinoma. Data for mRNA, miRNA and protein were z-normalized, 
and samples were organized in the horizontal direction by mRNA clustering. 
a, Papillary histology, FGFR3 alterations, FGFR3 expression and reduced 
FGFR3-related miRNA expression are enriched in cluster I. b, Expression of 
epithelial lineage genes and stem/progenitor cytokeratins are generally high in 
cluster III, some of which show variant squamous histology. c, Luminal breast 
and urothelial differentiation factors are enriched in clusters I and II. d, ERBB2 
mutation and oestrogen receptor beta (ESR2) expression are enriched in 
clusters I and IL. 
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bladder cancer cluster III (‘basal/squamous-like’) is similar to that of 
basal-like breast cancers, as well as squamous cell cancers of the head 
and neck and lung (Supplementary Fig. 4.2)°*°. All four of those cancer 
types express characteristic epithelial lineage genes, including KRT14, 
KRT5, KRT6A and EGFR. Basal-like subtype** and squamous cell 
subtype” of urothelial carcinoma have been independently reported. 
Many of the samples in bladder cluster III express cytokeratins (that is, 
KRT14 and KRT5) that were recently reported to mark stem/progenitor 
cells*®. Some of those samples also show a level of variant squamous 
histology (Fig. 3b). Bladder clusters I and II show features similar to 
those of luminal A breast cancer, with high mRNA and protein expres- 
sion of luminal breast differentiation markers, including GATA3 and 
FOXAI (Fig. 3c). Markers of urothelial differentiation such as the 
uroplakins (for example, UPK3A) are also highly expressed in clusters 
I and II, as are the epithelial marker E-cadherin and members of 
the miR-200 family of miRNAs (which target multiple regulators of 
epithelial-mesenchymal transition)** (Fig. 3c). Taken together, these 
observations indicate that, despite their diverse tissue origins, some 
bladder, breast, head and neck and lung cancers share common path- 
ways of tumour development. 

To determine whether the expression-based clusters could be seen 
in other data sets, we used the muscle-invasive bladder cancer samples 
from ref. 27, hierarchically clustering them with the genes used in our 
analysis. From the sample dendrogram, we identified four groups 
(Supplementary Fig. 4.3a). The four groups identified in the data set 
of ref. 27 correlated well with the four clusters identified in our TCGA 
data (Supplementary Fig. 4.3b). 

When we analysed the RNA-seq data for transcript splice variation 
using SpliceSeq” (Supplementary Information, section 11), one find- 
ing of interest was an average of 3% PKM1 and 97% PKM2 transcripts 
in the tumour samples. The PKM2 isoform of pyruvate kinase is the 
principal driver ofa shift to aerobic glycolysis in tumours (the Warburg 
effect)*°. Therefore, urothelial bladder cancers (and other cancer types) 
may prove sensitive to inhibition of glycolysis or related metabolic 
pathways. 


Pathway analysis and therapeutic targeting 


Integrated analysis of the mutation and copy-number data revealed three 
main pathways as frequently dysregulated in bladder cancer: cell cycle 
regulation (altered in 93% of cases); kinase and phosphatidylinositol- 
3-OH kinase (PI(3)K) signalling (72%); and chromatin remodelling, 
including mutations/somatic CNAs in histone-modifying genes (89%) 
and components of the SWI/SNF nucleosome remodelling complex 
(64%) (Fig. 4a). To complement these results for well-defined path- 
ways, we applied network analysis methods to examine other possible 
interactions between genes and pathways (Fig. 4b). In particular, we 
used the TieDIE algorithm to search for causal regulatory interactions 
within the PARADIGM network, which connects mutated genes to 
active transcriptional hubs****. The analysis identified a sub-network 
linking mutated histone-modifying genes to a large array of activated 
transcription factors, indicating potential far-reaching effects of his- 
tone modification on other pathways (Supplementary Fig. 8.2.1) con- 
verging on MYC/MAX regulation. Both MYC and MAX showed 
similar levels of pathway activity, independent of mutations in chro- 
matin genes, suggesting that mutations in histone-modifying genes 
provide just one mechanism for disruption of the MYC/MAX hub. 
By contrast, tumours with chromatin-related mutations showed dif- 
ferential activity of transcription factors FOXA2 and SP1, implicating 
de-differentiation processes as a result of the mutations. Our network 
analysis also identified HSP90AA 1 as a critical signalling hub, indicating 
that inhibitors of HSP90 may have therapeutic value in urothelial 
carcinoma. Although the linkages between mutations and transcrip- 
tional changes were statistically significant in terms of their proximity 
in the network (as determined by permutation tests; see Supplemen- 
tary Fig. 8.2), further studies will be needed to assess the biological 
relevance of the findings. 
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Figure 4 | Altered pathways and networks 


mutations and copy number alterations (CNA) in components of the p53/Rb 
pathway, RTK/RAS/PI(3)K pathway, histone modification system and SWI/ 
SNF complex. Red, activating genetic alterations; blue, inactivating genetic 
alterations. Percentages shown denote activation or inactivation of at least one 
allele. b, The network connecting mutated histone-modifying genes to 
transcription factors with differential activity (methodology and larger 
implicated network in Supplementary Fig. 8.2.1). Each gene is depicted as a 
multi-ring circle with various levels of data, plotted such that each ‘spoke’ in the 


Integrated analysis also identified mutations, copy number altera- 
tions or RNA expression changes affecting the PI(3)K/AKT/mTOR 


pathway in 42% of the tumours (Fig. 


point mutations in PIK3CA (17%; potentially responsive to PI(3)K 
inhibitors), mutation or deletion of TSC1 or TSC2 (9%; potentially 
responsive to mTOR inhibitors) and overexpression of AKT3 (10%; 
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potentially responsive to AKT inhibitors). We also observed muta- 
tions, genomic amplifications or gene fusions that affect the RTK/RAS 
pathway in 44% of the tumours (Fig. 5b, c). Included were events that 
can activate FGFR3 (17%; potentially responsive to FGFR inhibitors 
or antibodies), amplification of EGFR (9%; potentially responsive to 
EGFR antibodies or inhibitors), mutations of ERBB3 (6%; potentially 
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Figure 5 | Potential targets in bladder cancer. a, Alterations in the PI(3)K/ 
AKT/mTOR pathway are mutually exclusive. Tumour samples are shown in 


columns; genes in rows. Only samples with 


AKT3 shows elevated expression in 10% of samples, independent of copy 

number (right panel). Hetloss, heterozygous loss. b, Receptor tyrosine kinases 
are altered, by any of several different mechanisms (amplification, mutation or 
fusion), in 45% of samples. Only mutations that are recurrent in this data set or 


previously reported in COSMIC are shown. 


at least one alteration are shown. 


c, Recurrent mutations in ERBB2 


and ERBB3. The mutations shown in black are either recurrent in the TCGA 
data set or reported in COSMIC. Green, receptor L domain; red, furin-like 
cysteine-rich region; blue, growth factor receptor domain IV; yellow, tyrosine 
kinase domain. d, ERBB2 amplifications and recurrent mutations in other 
cancers profiled by TCGA. Missense mutations were counted in the following 
positions: G309, $310, L313, R678, T733, L755, V777, D769, V842, T862, R896 
and M916I. In-frame insertions were counted between amino acids 774 and 
776. Only tumour types with an alteration frequency =2% are shown. 
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sensitive to ERBB kinase inhibitors) and mutation or amplification of 
ERBB2 (9%; potentially sensitive to ERBB2 kinase inhibitors or antibodies). 
ERBB3 mutations in bladder cancer have been noted previously*, but 
statistically significant mutation of ERBB2 in bladder cancer has not 
been reported. Both genes are potential therapeutic targets in other 
diseases**-*°. Notably, ERBB2 alterations were approximately as fre- 
quent in this study as in TCGA breast cancers, but with fewer ampli- 
fications and more mutations (Fig. 5d)**. 


Discussion 


This integrated study of 131 invasive urothelial bladder carcinomas 
provides numerous novel insights into disease biology and delineates 
multiple potential opportunities for therapeutic intervention. Treatment 
for muscle-invasive bladder cancer has not advanced beyond cisplatin- 
based combination chemotherapy and surgery in the past 30 years”®, 
and no new drugs for the disease have been approved in that time. 
Median survival for patients with recurrent or metastatic bladder can- 
cer remains 14-15 months with cisplatin-based chemotherapy, and 
there is no widely recognized second-line therapy’. With the excep- 
tion of a single case report, there is also no known benefit from treat- 
ment with newer, targeted agents**. Several of the genomic alterations 
identified in this study, particularly those involving the PI(3)K/AKT/ 
mTOR, CDKN2A/CDK4/CCNDI1 and RTK/RAS pathways, including 
ERBB2 (Her-2), ERBB3 and FGFR3, are amenable in principle to 
therapeutic targeting. Clinical trials based on patients with relevant 
druggable genomic alterations are warranted. 

FGFR3 mutation is a common feature of low-grade non-invasive 
papillary urothelial bladder cancer, but it occurs at a much lower fre- 
quency in high-grade invasive bladder cancer. The cluster analysis in 
Fig. 3 highlights multiple mechanisms of FGFR3 activation, and its 
strong association with papillary morphology. The data presented here 
suggest a subset of muscle-invasive cancers that can potentially be 
targeted through FGFR3. Similarly, ERBB2 amplification may be targe- 
table by strategies used in breast cancer, by small-molecule tyrosine kinase 
inhibitors or by novel immunotherapeutic approaches (NCT01353222)™. 
The data here provide further support for several on-going ERBB2- 
targeted trials in bladder cancer and further define the subpopulation 
of cancers suited to that approach. Finally, cluster III of the integrated 
expression profiling analysis reveals the existence of a urothelial car- 
cinoma subtype with cancer stem-cell expression features (including 
KRT14 and KRT5), perhaps providing another avenue for therapeutic 
targeting. 

The alterations identified in epigenetic pathways also suggest new 
possibilities for bladder cancer treatment. Ninety-nine (76%) of the 
tumours analysed here had an inactivating mutation in one or more of 
the chromatin regulatory genes, and 53 (41%) had at least two such 
mutations. Overall, the bladder cancers showed a mutational spectrum 
highly enriched with mutations in chromatin regulatory genes (Sup- 
plementary Table 2.10). Furthermore, integrated network analyses 
revealed a profound impact of those mutations on the activity levels 
of various transcription factors and pathways implicated in cancer. 
Drugs that target chromatin modifications—for example, recently 
developed agents that bind acetyl-lysine binding motifs (bromodo- 
mains)—might prove useful for treatment of the subset of bladder 
tumours that exhibit abnormalities in chromatin-modifying enzymes”. 
Our findings overall indicate bladder cancer as a prime candidate for 
exploration of that approach to therapy. 


METHODS SUMMARY 


Tumour and normal samples were obtained with institutional-review-board- 
approved consent and processed using a modified AllPrep kit (Qiagen) to obtain 
purified DNA and RNA. Quality-control analyses revealed only modest batch 
effects (Supplementary Information, section ‘Batch effects’). The tumours were 
profiled using Affymetrix SNP 6.0 microarrays for somatic CNAs, low-pass WGS 
(HiSeq) for somatic CNAs and translocations, RNA-seq (HiSeq) for mRNA and 
miRNA expression, Illumina Infinium (HumanMethylation450) arrays for DNA 
methylation, HiSeq for exome sequencing and RPPA for protein expression and 
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phosphorylation. Statistical analysis and biological interpretation of the data were 
spearheaded by the TCGA genome data analysis centres. Sequence files are in 
CGHub (https://cghub.ucsc.edu/). All other molecular, clinical and pathological data 
are available through the TCGA Data Portal (https://tcga-data.nci.nih.gov/tcga/). The 
data can be explored through a compendium of next-generation clustered heat 
maps (http://bioinformatics.mdanderson.org/TCGA/NGCHMPortal/), the cBio 
Cancer Genomics Portal (http://cbioportal.org), TieDIE (http://sysbiowiki.soe. 
ucsc.edu/tiedie), SpliceSeq (http://bioinformatics.mdanderson.org/main/SpliceSeq: 
Overview), MBatch batch effects assessor (http://bioinformatics.mdanderson.org/ 
tcgambatch/) and Regulome Explorer (http://explorer.cancerregulome.org/). Also 
see Supplementary Information. 
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Coupling of angiogenesis and osteogenesis 
by a specific vessel subtype in bone 


Anjali P. Kusumbe'*, Saravana K. Ramasamy'* & Ralf H. Adams!* 


The mammalian skeletal system harbours a hierarchical system of mesenchymal stem cells, osteoprogenitors and 
osteoblasts sustaining lifelong bone formation. Osteogenesis is indispensable for the homeostatic renewal of bone as 
well as regenerative fracture healing, but these processes frequently decline in ageing organisms, leading to loss of bone 
mass and increased fracture incidence. Evidence indicates that the growth of blood vessels in bone and osteogenesis are 
coupled, but relatively little is known about the underlying cellular and molecular mechanisms. Here we identify a 
new capillary subtype in the murine skeletal system with distinct morphological, molecular and functional properties. 
These vessels are found in specific locations, mediate growth of the bone vasculature, generate distinct metabolic and 
molecular microenvironments, maintain perivascular osteoprogenitors and couple angiogenesis to osteogenesis. The 
abundance of these vessels and associated osteoprogenitors was strongly reduced in bone from aged animals, and 
pharmacological reversal of this decline allowed the restoration of bone mass. 


Blood vessels mediate the transport of circulating cells, oxygen, nutri- 
ents and waste products, but also provide so-called angiocrine signals 
controlling organ growth and homeostasis’~. In addition to well-known 
differences between arteries, capillaries and veins, endothelial cells (ECs) 
are highly heterogenous and acquire specialized functional properties 
in local microenvironments”®. The nature of these specialization pro- 
cesses and the underlying regulation remain largely unknown. 

In the mammalian skeletal system, growth of the vascular network 
is regulated by signals provided by chondrocytes and other bone cells, 
among which the vascular endothelial growth factor (VEGF) is best 
understood’*. Conversely, blood vessels are thought to influence the 
osteogenic generation of new bone”””. In addition to earlier studies high- 
lighting the close proximity of vascular and osteoblastic cells, potential 
roles of angiogenic blood vessel growth in fracture healing have been 
proposed'"”. It also has been suggested that alterations in the skeletal 
microvasculature might be linked to compromised haematopoiesis and 
osteogenesis in human subjects with primary osteoporosis or during old 
age’*"*, However, direct evidence for such disease-causing or age-related 
alterations is lacking and our understanding of the normal organization, 
functional specialization and precise function of the skeletal vasculature 
is incomplete. 


Vessel architecture and bone oxygenation 


Previous work has shown that molecular and structural differences distin- 
guish arteries and distal arterioles in bone from sinusoidal capillaries". 
Sinusoidal capillaries form highly branched and irregular networks, and 
are primarily found between haematopoietic cells within the marrow 
cavity. Arteries contain comparably few side branches and, as earlier 
publications’””° have shown, preferentially enter long bone through the 
femoral or tibial head. The precise overall organization of the skeletal 
vasculature has remained poorly understood because of technical dif- 
ficulties associated with the processing of bone combined with the loss 
of crucial three-dimensional information in thin tissue sections. Revised 
immunohistochemistry protocols (see Methods) have now allowed us 
to image the bone vasculature at high resolution. To characterize the 
organization of arteries, «-smooth muscle actin («-SMA) staining on 
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thick tibia sections (300 jum) was performed. As proposed previously 
straight and smooth muscle cell-covered arteries entered through the 
bone cortex (Fig. 1a). The central diaphysis contained few, largely un- 
branched arteries (diameter 16-27 jum). In contrast, substantial branching 
was seen in the metaphysis when arteries (diameter 5-15 jum) approached 
the growth plate (Fig. 1a), a structure that in small rodents, unlike in 
humans, persists into adulthood. Some distal arterioles terminated at 
capillaries in the endosteum (the connective tissue lining the inner surface 
of compact bone), whereas most termination points were found in the 
metaphysis near the growth plate (Fig. 1b, cand Extended Data Fig. 1a). 
Veins were located in the central diaphysis, connected to metaphyseal 
capillaries and exited through the cortical bone collar (Extended Data 
Fig. 2a). Thus, arterial oxygen and nutrient-rich blood is preferentially 
fed into capillaries near the growth plate, which might affect the local 
metabolic status. Analysis of tibia sections by pimonidazole staining indi- 
cated low oxygenation of the diaphysis whereas the metaphysis was not 
hypoxic (Extended Data Fig. 2b-e). Likewise, hypoxia-inducible factor 
1 & (HIF1-«) immunostaining abundantly labelled cell nuclei through- 
out the diaphysis and in secondary ossification centres but not in meta- 
physis (Extended Data Fig. 3a). The plasma membrane lactate transporter 
MCTA4 (also known as Slcl6a3), a hypoxia-regulated protein”’, and glu- 
cose transporter-1 (Glut1), which is induced by HIF signalling and low 
glucose”, were abundant throughout the diaphysis but not in the meta- 
physis (Extended Data Fig. 3b-d). Conversely, phosphorylated mitogen- 
activated protein kinase (phospho-Erk1/2), a marker of cell growth and 
high metabolic activity’, was strongly enriched in metaphyseal cells 
(Extended Data Fig. 3e). These data indicate that the spatial distribution 
of arteries and arteriole-capillary connections determine regional dif- 
ferences in oxygenation and metabolic activity. 


Identification of a distinct EC subpopulation in bone 

In addition to revised immunofluorescence protocols, we visualized bone 
vessels by endothelial-cell-specific expression of green fluorescent protein 
(GFP) in tamoxifen-treated Cdh5(PAC)-CreERT2, Rosa26-mT/mG dou- 
ble transgenic mice**”*. Imaging of the bone microvasculature with both 
approaches uncovered structurally distinct capillary subsets. Consistent 


1Max Planck Institute for Molecular Biomedicine, Department of Tissue Morphogenesis, D-48149 Munster, Germany. *University of Munster, Faculty of Medicine, D-48149 Minster, Germany. 
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Figure 1 | Identification of bone vessel subtypes. a, Confocal images showing 
a-SMA" arteries (green) in thick sections (300 um) of juvenile 4-week-old 
tibia. Nuclei, DAPI (4’,6-diamidino-2-phenylindole, blue). Arteries enter 
through the cortex (red arrowhead), terminate in endosteum (yellow), and 
branch in the metaphysis (mp) or diaphysis (dp) (white). Dashed lines mark 
growth plate (gp) or endosteum (es). b, a-SMA* (white) arteries and 
a-SMA™ CD31~ (green) Emcn’ (red) arterioles terminate in CD31‘ Emcn* 
capillaries within metaphysis and endosteum (es). c, Quantitation of a-SMA~ 
(top) and CD31*Emcn (bottom) arterial branch points in 4-week-old tibia. 
Data represent mean + s.e.m. (n = 6 mice for quantification of a-SMA* 
arterial branch points from six independent experiments and n = 4 for 
quantification of CD31*Emen™ arterial branch points; both from four 
independent experiments). P values, two-tailed unpaired t-tests. 

d, Representative tile scans of GFP* (green) endothelium in juvenile 
Cdh5(PAC)-CreERT2, Rosa26-mT/mG double heterozygous mouse tibia. 
Yellow dashed line marks growth plate (gp). Note distinct organization and 
interconnections (arrowheads) of microvessels in metaphysis (mp) and 
diaphysis (dp). e, Maximum intensity projections of endothelial cell columns in 
metaphysis (mp, arrowheads mark distal protrusions), which are connected 
(arrowheads in central panel) to highly branched sinusoids in diaphysis (dp). 
f, Confocal tile scan of juvenile 3-week-old tibia showing CD31* (green) and 
Emcn* (red) endothelial cells in metaphysis (mp) and endosteum 
(arrowheads). g, Representative flow cytometry dot plots showing 
CD31"Emcn™ endothelial cells in total and lineage (Lin) depleted bone 
marrow cells (BMCs). h, Relative abundance of endothelial cell (EC) subtypes 
in 4-week-old long bone. CD31™Emcn™ cells represent 1.77 + 0.01% 

(mean + s.d., 1 = 7 mice from three independent experiments) of total 
endothelial cells. 
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with earlier dye injection and corrosion cast experiments’®”*, endothe- 
lial tubes in the metaphysis resembled straight columns that were 
interconnected by distal vessel loops or arches. In contrast, diaphyseal 
capillaries displayed the highly branched pattern characteristic of the 
sinusoidal vasculature of bone marrow (Fig. 1d, e and Extended Data 
Fig. 4a, b). At the interface between metaphysis and diaphysis, the two 
vessel types were connected confirming that they were part of one 
continuous vascular bed (Fig. le). Cdh5(PAC)-CreERT2 and Rosa26- 
mT/mG double transgenics also validated the endothelial cell specifi- 
city of Cre and absence of GFP in chondrocytes, osteoprogenitors, 
osteocytes or haematopoietic cells (Extended Data Fig. 4c-e). 

The different vessel types were distinguishable by immunostaining 
with specific cell surface markers. Columnar tubes and arches in the 
metaphysis and endosteal endothelial cells were strongly positive for 
CD31 (also known as PECAM1) and Endomucin (Emcn), whereas 
sinusoidal vessels in the diaphysis displayed only weak CD31 staining 
and slightly lower Emcn expression (Fig. 1fand Extended Data Fig. 4f, g). 
A distinct CD31™Emcn”™ endothelial subset could be also identified 
and separated from CD31'°Emen” cells in single-cell suspensions of 
long bones (see Methods) (Fig. 1g). Analysis of arteries in relation to 
capillary subsets indicated that distal CD31* Emcn~ arterioles termi- 
nated within the CD31™Emcn" endothelium (Fig. 1b). Quantitative 
analysis by flow cytometry showed that CD31"Emcn” cells represented 
only a small fraction of total endothelial cells (Fig. 1h). 

These observations established the existence of spatial and pheno- 
typic heterogeneity in the endothelium of long bone, which also applied 
to other skeletal elements in mice such as vertebra, sternum and cal- 
varium (Extended Data Fig. 5a—d). Analysis of endothelial cells from 
other organs showed that they all, with the exception of liver, lacked a 
CD31"Emcn*" subset (Extended Data Fig. 5e). On the basis of these 
findings, we propose the following terminology for bone microvessels: 
type H for the small CD31"Emcn™ subset and type L for the CD31’° 
Emcn’? sinusoidal vessels. 


Type H endothelium and osteoprogenitor cells 


Immunostaining showed that Osterix” osteoprogenitors, which will give 
rise to osteoblasts and osteocytes”, were selectively positioned around 
type H but not type L endothelium (Fig. 2a). Likewise, collagen type 
1a." osteoblastic cells and Runx2* early osteoprogenitors were abund- 
ant around the CD31* columns in the metaphysis and CD31* endo- 
steal vessels, but essentially absent from the vicinity of diaphyseal type L 
vessels (Fig. 2b, c). Despite the low frequency (~1.77%) of type H 
endothelial cells in the bone endothelial cell fraction and ~0.015% in 
total bone marrow (Fig. 1g), the majority of Runx2* (82.63 + 1.8%), 
collagen la* (74 + 3.3%) and Osterix® cells (70 + 1.9%) were located 
directly adjacent to CD31"Emcn™ vessels (Fig. 2d). Mesenchymal cells 
expressing platelet-derived growth factor receptor fi (PDGFRB) were 
also preferentially associated with type H vessels in metaphysis and 
endosteum (Fig. 2e). 

To understand this distribution pattern of osteoblastic cells, the 
expression of mRNAs for secreted growth factors with known roles 
in osteoprogenitor survival and proliferation was analysed in freshly 
purified endothelial cells from long bone. Pdgfa, Pdgfb, Tgfb1, Tgfb3 
and Fefl transcripts were expressed significantly higher in type H 
relative to type L endothelial cells (Fig. 3a). Likewise, quantitative PCR 
(qPCR) analysis of flushed tibiae retaining the endosteal type H endothe- 
lial cells revealed strong enrichment of the same transcripts in compar- 
ison to extracted marrow containing type L endothelial cells (Extended 
Data Fig. 6a, b). Accordingly, the two bone capillary endothelial cell sub- 
sets have specific expression profiles suggesting specialized functional 
properties. 


Loss of type H endothelium during ageing 

It has been previously reported that osteoblast numbers decline during 
ageing”*. Due to reduced osteogenesis, bone quality and fracture heal- 
ing potential deteriorate with age, whereas fracture incidences become 
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Figure 2 | Osteoprogenitor association with type H endothelial cells. 

a, Confocal images of 4-week-old tibia with indicated stainings. Growth 

plate (gp) and bone marrow cavity (bm) are marked. Osterix" cells are found 
around CD31™Emcn™ vessels in metaphysis and endosteum (arrowheads). 
b, Collagenio* cells (green) surround columnar CD31" vessels (red) but 
not CD31 Emcn* type L bone marrow (bm) sinusoids in 3-week-old 

mouse tibia. c, Immunostained 4-week-old tibia showing association of 
Runx2* osteoprogenitors (green) with CD31” (red) vessels in metaphysis and 
endosteum (es). Minimum exposure was used to capture only cells with high 


more frequent”. A remarkable decline in osteoprogenitors was seen in 


long bone of ageing mice, accompanied by a significant decrease of 
transcripts for mesenchymal cell and osteoprogenitor markers and the 
loss of bone mass (Extended Data Fig. 6c-f). These changes correlated 
with pronounced reduction of type H vessels, which were much more 
abundant in juvenile (4-week-old) mice compared to (11-week-old) 
adults, and were nearly absent in aged (70-week-old) animals (Fig. 3b 
and Extended Data Fig. 6c). Flow cytometry analysis confirmed this 
age-dependent reduction of CD31™Emcn™ endothelial cells, whereas 
the total endothelial cell number in bone did not change significantly 
(Fig. 3c and Extended Data Fig. 6g). Endothelial cell proliferation was 
high within the type H subpopulation in juvenile mice and declined 
rapidly in adulthood (Fig. 3d and Extended Data Fig. 6h). In contrast, 
the rate of type L endothelial cell proliferation did not differ signifi- 
cantly between juvenile and older animals (Fig. 4d). While flow cyto- 
metry had suggested the potential existence of a liver CD31™Emcn™ 
endothelial cell subpopulation, these cells did not decline in adult and 
aged mice (Extended Data Fig. 6i). 


Type H endothelium mediates neo-angiogenesis in bone 


Genetic lineage tracing is a powerful approach for the investigation of 
cell amplification, differentiation and migration processes*®**. Single 
injection of 4-week-old Cdh5(PAC)-CreERT2, Rosa26-mT/mG dou- 
ble heterozygotes with a low dose of tamoxifen (see Methods) led to 
selective green fluorescent protein (GFP) expression in metaphyseal 
and endosteal type H endothelial cells as well as the endothelium of 
CD31” arteries (Fig. 4a and Extended Data Fig. 7a). The analysis of 
mice at day 40 after tamoxifen administration (at 9.7 weeks of age) 
revealed profound expansion of the GFP-positive endothelial cells, 
which were abundant throughout the metaphysis and diaphysis (Fig. 4a 
and Extended Data Fig. 7b, c). As the latter contains CD31!°Emcn'® 
vessels, this finding established that type H endothelial cells can give 
rise to type L endothelium. This, together with their high proliferative 
capacity, suggested that type H endothelial cells are hierarchically up- 
stream of the majority of the bone sinusoidal (type L) endothelium. 
Given the robust labelling of Cdh5(PAC)-CreERT2, Rosa26-mT/mG 
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double heterozygous arteries in this fate mapping approach, we cannot 
resolve whether type H endothelial cells give rise to arteries in postnatal 
bone. Arterial endothelial cells show low proliferative activity’ argu- 
ing against a significant contribution of arterial GFP™ cells to type L 
capillaries. 

Exposure to irradiation is known to induce regression of sinusoidal 
endothelium, and a transient but substantial decline in endothelial cell 
number'®. Flow cytometric quantification at 7 days post-irradiation 
showed a significant increase in type H endothelial cells and strong 
reduction of the type L subpopulation (Fig. 4b). Likewise, immunostain- 
ing of tibia sections showed the presence of CD31"'Emcn™ endothelial 
cells throughout the diaphysis (Fig. 4b). These findings are consistent 
with a role for CD31™Emcn™ vessels in developmental and regenerat- 
ive neo-angiogenesis in bone. 


HIF role in angiogenesis and osteogenesis 

HIF-1o controls physiological and pathological neo-angiogenesis™. 
In addition to its role in the upregulation of VEGF-A expression in 
hypoxic tissues, HIF-1o activity in endothelial cells is important for 
wound healing and tumour angiogenesis in mice*. To investigate HIF-1a 
function in the postnatal bone endothelium, inducible endothelial- 
specific loss-of-function mice (Hifla'“®°) were generated by combin- 
ing loxP-flanked Hif1a alleles (Hif a@*/0X)35 and Cdh5(PAC)-CreERT2 
transgenics. Following tamoxifen administration from postnatal day 
10 (P10) to P14, analysis of Hifla'“?° mutants at P20 revealed striking 
vascular defects. Whereas the amount of type H endothelium was strongly 
reduced in metaphysis and endosteum, the number of diaphyseal type L 
vessels and endothelial cells was comparable to control littermates (Fig. 4c 
and Extended Data Fig. 7d, e). 

The von Hippel-Lindau (VHL) E3 ubiquitin protein ligase controls 
the stability and thereby biological activity of HIF-1o% and other sub- 
strates*®. Inducible, endothelial cell specific targeting of the murine Vhl 
gene with the same strategy as described above for Hifla led to pro- 
nounced expansion of type H endothelium and metaphyseal vessel 
columns (Fig. 4d and Extended Data Fig. 7f). These changes were accom- 
panied by expansion of the metaphysis, increased formation of trabecular 
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Figure 3 | Properties and age-dependent decline of type H endothelial cells. 
a, GPCR analysis of growth factor expression (normalized to Actb) by 
CD31™Emcn™ endothelial cells relative to CD31'°Emen’° ECs sorted from 
murine tibia. Data represent mean + s.e.m. (n = 3 biological replicates). 

P values, two-tailed unpaired t-test. ND, not detectable. b, Confocal images of 
CD31 (green) and Emcn (red) immunostained tibia sections at 4 or 11 or 

70 weeks. Note age-dependent decline of CD31"Emcn"™ endotheial cells in 
metaphysis (top row) and endosteum (es, arrowheads). c, Flow cytometric 
quantitation of CD3 1Emcn* endothelial cells (ECs) from long bone of the 
indicated age group. Total endothelial cells were not significantly changed 
(bottom). Data represent mean = s.e.m. (n = 7 mice from three independent 
experiments). d, Fold change in EdU~ type H and type L endothelial cell 
numbers in murine tibia determined by flow cytometry at indicated ages. Data 
represent mean + s.e.m. (n = 7 mice from two independent experiments). 


boneand higher numbers of Runx2~ and Osterix* osteoprogenitors in 
ValA*© mutants (Fig. 5a, b and Extended Data Fig. 7g). Osteoprogenitors 
were significantly reduced in Hifla'“© samples (Fig. 5c, d). Likewise, 
blood alkaline phosphatase levels (a biomarker for osteogenesis) were 
elevated in Vil" mutants and reduced in Hifla‘“"“ animals (Extended 
Data Fig. 8a, b). Histomorphometric analysis and microtomography 
(u-CT) confirmed increased bone mass in Vii AEC mutants without 
detectable change in calcitonin receptor-positive osteoclasts (Extended 
Data Fig. 8c-m). Parathyroid hormone and calcitonin, hormones that 
regulate bone remodelling, and serum calcium and phosphate levels 
were not significantly changed in Hifla'“P° or Vhi**° mutants (data 
not shown). 

To understand why endothelial-cell-specific deletion of Hifla or 
Vhl primarily affected type H vessels, we analysed HIF-1« expression 
in 2-week-old bone, which corresponds to the time point of tamoxifen 
administration. Strong HIF-1o% immunostaining was seen in type H 
but not type L endothelium (Extended Data Fig. 9a). Furthermore, ana- 
lysis of freshly isolated type H cells contained significantly higher Hifla 
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Figure 4 | Type H endothelial cells mediate bone vessel growth. a, Maximum 
intensity projection showing restricted GFP labelling (green) of arteries and 
CD31>*Emcn"™ vessels in 4-week-old Cdh5(PAC)-CreERT2, Rosa26-mT/mG 
tibia at day 1 after tamoxifen administration (left). At day 40, GFP* endothelial 
cells were found throughout the metaphysis (mp) and diaphysis (dp, 
arrowheads). Nuclei, DAPI (blue). b, Quantitation of type H and type L 
endothelial cells in long bone at day 7 after sublethal irradiation. Data represent 
mean + s.e.m. (1 = 16 mice, two independent experiments). P values, two- 
tailed unpaired t-test. Bottom panels show representative images of control and 
irradiated diaphysis. CD31Emcn™ endothelial cells (arrowheads) were found 
throughout the diaphysis after irradiation. c, d, Maximum intensity projections 
of immunostained, 3-week-old Hiflai“®° (c) or ValA®< tibiae (d) with 
littermate controls. Note reduction of CD31"Emcn”™ endothelial cells 
(arrowheads) in Hifl @AkC metaphysis (mp) and expansion in Vhi*¥© samples. 
Growth plate, gp; diaphysis, dp. 


transcript levels than type L endothelial cells (Extended Data Fig. 9b). 
Transcripts encoding HIF-1B, which heterodimerizes with HIF-1a, 
were not significantly different (data not shown). In endothelial cells 
of adult and ageing animals, Hifla mRNA was slightly reduced (0.5- 
fold) but HIF-1« protein was no longer detectable in endothelial cells 
(Extended Data Fig. 9c, d), which correlated with the age-dependent 
loss of type H vessels. Conversely, loss of type L endothelial cells and 
expansion of CD3 1Emcn* endothelium after irradiation (Fig. 4b) 
was accompanied by increased HIF-1% expression at both protein and 
transcript levels (Extended Data Fig. 9e, f). These findings support a 
key role of HIF-1o in the induction of type H endothelial cells. 
Prolyl-4-hydroxylases (PHDs) modify HIF-1o and thereby mark the 
protein for degradation under normoxic conditions. Accordingly, PHD 
inhibitors, such as deferoxamine mesylate (DFM), enhance HIF-1a 
stability and activity’’. Treatment of isolated primary bone endothelical 
cells with DFM led to upregulation of Pecam1 and Emcn transcripts, 
but also increased the expression of growth factor messenger RNAs that 
were also enriched in freshly isolated CD31"Emcn™ endothelial cells 
(Fig. 5e). Next, we tested whether DFM promotes CD3 1Emcn*" endo- 
thelical cells, neo-angiogenesis and osteogenesis in aged animals. Although 
long bones of aged, 64- to 70-week-old mice treated with vehicle con- 
trol contained very few CD3 1'Emcnb! vessels, DFM administration (see 
Methods) led to substantial expansion of type H endothelium (Fig. 5f 
and Extended Data Fig. 10a) and emergence of vessel-associated Osterix* 
cells (Fig. 5g and Extended Data Fig. 10b-d). Expression levels of osteo- 
progenitors and osteoblasts markers were significantly increased in long 
bone and calvarium of DFM-treated animals (Fig. 5h and Extended 
Data Fig. 10e). Furthermore, 1-CT examination showed that 6 weeks 
of DFM treatment led to significantly increased bone mass (Fig. 5i, j). 
Although the activity of DFM is not restricted to endothelial cells and 
is likely to affect multiple cell populations, the findings above argue 
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Figure 5 | Type H endothelial cells couple angiogenesis and osteogenesis. 
a, Confocal images of CD31 (green), Endomucin (red) and Osterix (white) 
immunostained, 3-week-old VhI'“5© and control tibiae. b, Quantitation of 
Runx2* and Osterix* (Osx) cells in VAFA®° mutants and controls. Data 
represent mean = s.e.m. (n = 5 mice in two independent experiments). P values, 
two-tailed unpaired t-test. c, Maximum intensity projections of 3-week-old 
Hifl @“£C and control tibia stained for CD31 (green), Endomucin (red) and 
Osterix (white). Growth plate, gp. d, Quantitation of Runx2* and Osterix” cells 
in Hifla’“F° mutant and control long bone. Data represent mean + s.e.m. 

(n = 5 mice in three independent experiments). P values, two-tailed unpaired 
t-test. e, GPCR mRNA analysis of DFM or vehicle-treated bone endothelial cells 
(BECs). Data represent mean + s.e.m. (” = 3 biological replicates). Note 
induction of growth factor expression by DEM. f, g, Representative images of 
CD31 (red, f, g) and Osterix (green, g) stained tibia sections from aged DFM- 
treated and control mice. Low intensity projection shows only CD31 cells. 
DFM induces CD31" vessels and Osterix * osteoprogenitors. Chondrocytes, ch. 
h, qPCR analysis of Sp7 (encoding Osterix), Bglap (bone gamma- 
carboxyglutamate protein) and Ibsp (integrin-binding sialoprotein) mRNA 
expression relative to Actb in the aged DFM-treated or vehicle control femur. 
Data represent mean = s.e.m. (n = 6-8 mice in four independent experiments). 
P values, two-tailed unpaired t-test. i, Representative |1-CT images of aged 
DFM.-treated and control tibiae. j, Quantitative 1-CT analysis of relative bone 
volume (Rel. BV; bone volume/total volume), trabecular number (Tb. nb.), and 
trabecular thickness (Tb. thickness) in proximal tibia. Data represent 

mean + s.e.m. (n = 5 mice in two independent experiments). P values, 
two-tailed unpaired t-test. 


for crucial roles of endothelial HIF in controlling bone angiogenesis, 
type H vessel abundance, endothelial growth factor expression and 
osteogenesis. 


Discussion 


The mammalian skeleton sustains stem and progenitor cell populations 
allowing lifelong haematopoiesis and osteogenesis**”’. These processes 
rely on niche microenvironments for which a number of critical cell 
types and molecular factors have been identified*”“". In the case of bone 
formation, mesenchymal stem cells generate new osteoblasts and their 
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progenitors, which are essential for bone homeostasis and fracture 
healing**“’. Blood vessel growth and the invasion of osteoprogenitors 
appear coupled in bone fractures* but the communication between 
endothelial and osteoblastic cells remains little understood. Likewise, 
the exact architecture of the bone vasculature or the processes medi- 
ating its growth had remained unknown. Some of these key questions 
are now resolved in this manuscript. The finding that capillaries in the 
skeletal system of mice can be subdivided into type H and type L 
endothelium on the basis of morphological, molecular and functional 
criteria should be very useful for future studies in basic and medical 
research. CD31™Emcn™ capillaries at the distal end of the arterial 
network in bone might represent the central building block of a meta- 
bolically specialized tissue environment with privileged access to oxy- 
gen and nutrients, which probably influences the growth potential and 
metabolism of other cell types. This is not only relevant for osteoblastic 
cells but potentially also for haematopoietic stem and progenitor cells, 
which preferentially home to the metaphysis after transplantation”. 
However, there is also evidence that the bulk of haematopoietic stem 
cells are exposed to low oxygen, whereas committed and differentiating 
haematopoietic progenitors reside in relatively well-oxygenated envir- 
onments*. The latter is consistent with our recent description of the 
hemosphere, a specialized microenvironment found in the mouse meta- 
physis that enables the clonal expansion of haematopoietic cells**. 

We also propose that type H endothelial cells mediate local growth 
of the vasculature and provide niche signals for perivascular osteopro- 
genitors. Type H vessel formation and the expression of potential angio- 
crine factors for osteoblastic cells are enhanced by HIF and, as we show 
in the accompanying manuscript”, by Notch signalling. Thus, the abund- 
ance of CD31™Emcn™ endothelial cells may be useful as diagnostic read- 
out for the growth status of the bone vasculature and its pro-osteogenic 
capacity. Our results also indicate that specific molecular pathways can 
be used to boost type H vessel formation and osteogenesis (Extended 
Data Fig. 1b). This might be of great importance for conditions invol- 
ving compromised fracture healing or loss of bone mass. Ageing and 
post-menopausal oestrogen deficiency are major risk factors for osteo- 
porosis**, and oestrogen can promote angiogenesis”. Accordingly, 
decline of type H vessels and the concomitant reduction of osteopro- 
genitor cells could potentially offer a compelling explanation for the 
loss of bone mass during ageing and might enable therapeutic improve- 
ment of osteogenesis in elderly people. 


METHODS SUMMARY 

Genetically modified and aged mice. C57BL/6J males were used for experiments 
at the age of 2-4 weeks (juvenile), 8-12 weeks (adult) and 57-70 weeks (aged). 
Endothelial-cell-specific gene deletions were generated using Cdh5(PAC)-CreERT2 
transgenic mice, which were interbred with conditional mutants carrying loxP- 
flanked Hifla (Hifla'*"*)*°. To induce Cre activity and gene inactivation, pups 
were injected with 500 jig tamoxifen (Sigma, T5648) intraperitoneally (i.p.) every- 
day from P10 to P14. Femurs and tibiae from Cdh5(PAC)-CreERT2" * Hiftal*"™* 
(Hifl aC) mutants and Cre-negative (Hifl a°*%) controls were collected on P20 
or P37 after euthanasia. The same approach was used for experiments involving 
conditional Vil mutant mice”. For genetic labelling of metaphyseal vasculature, 
Cdh5(PAC)-CreERT2"'* mice were mated with R26-mG/mT reporters. At P29, 
Cdh5(PAC)-CreERT2"'*/R26-mG/mT"'* mice received a single dose of 50 pg 
tamoxifen and were analysed either 1 or 40 days later. 

All animals were genotyped by PCR. Protocols and primer sequences are pro- 
vided upon request. When indicated, adult wild-type mice were whole-body irra- 
diated with a single dose of 900 rads (Gammacell irradiator) and euthanized 7 days 
later. 

For DFM treatment, freshly prepared deferoxamine mesylate (Sigma) in water 
(15 mg ml per mouse) was injected i.p. every other day for 4 weeks or, for 1-CT 
analysis, 5 weeks. Control animals received the same amount of sterile water. 

For labelling of proliferating cells, mice were intraperitoneally injected with 
1.6 mg per kg weight of EdU (Invitrogen) 2h before euthanasia. Tibiae were 
immediately collected and processed. Bone marrow cells and bone sections were 
stained for EDU using Click-iT chemistry following the manufacturer’s instruc- 
tions (Invitrogen). 

Experiments involving animals were performed according to the institutional 
guidelines and laws, following protocols approved by local animal ethics committees. 
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METHODS 

Genetically modified and aged mice. C57BL/6J males were used for experiments 
at the age of 2-4 weeks (juvenile), 8-12 weeks (adult) and 57-70 weeks (aged). 
Endothelial-cell-specific gene deletions were generated using Cdh5(PAC)-CreERT2 
transgenic mice, which were interbred with conditional mutants carrying loxP- 
flanked Hifla (Hifla’*")*’. To induce Cre activity and gene inactivation, pups 
were injected with 500 jig tamoxifen (Sigma, T5648) intraperitoneally (i-p.) every- 
day from P10 to P14. Femurs and tibiae from Cdh5(PAC)-CreERT2"* Hifla’“"™* 
(Hifl @“C) mutants and Cre-negative (Hifl a") controls were collected on P20 
or P37 after euthanasia. The same approach was used for experiments involving 
conditional Vhi mutant mice”’. For genetic labelling of metaphyseal vasculature, 
Cdh5(PAC)-CreERT2''* mice were mated with R26-mG/mT reporters. At P29, 
double heterozygous Cdh5(PAC)-CreERT2" * /R26-mG/mT"'* offspring received 
a single dose of 50 1g tamoxifen and was analysed either 1 or 40 days later. 

All animals were genotyped by PCR. Protocols and primer sequences are provided 
upon request. When indicated, adult wild-type mice were whole-body irradiated 
with a single dose of 900 rads (Gammacell irradiator) and euthanized 7 days later. 

For DEM treatment, freshly prepared deferoxamine mesylate (Sigma) in water 
(15 mg ml‘ per mouse) was injected i.p. every other day for 4 weeks or, for 1-CT 
analysis, 5 weeks. Control animals received the same amount of sterile water. 

For labelling of proliferating cells, mice were intraperitoneally injected with 
1.6 mg per kg weight of EdU (Invitrogen) 2h before euthanasia. Tibiae were 
immediately collected and processed. Bone marrow cells and bone sections were 
stained for EDU using Click-iT chemistry following the manufacturer’s instruc- 
tions (Invitrogen). 

Experiments involving animals were performed according to the institutional 
guidelines and laws, following protocols approved by local animal ethics committees. 
Bone immunohistochemistry. Freshly dissected bone tissues collected from wild- 
type mice or from mutants and their control littermates were immediately fixed in 
ice-cold 4% paraformaldehyde solution for 4h. Decalcification was carried out 
with 0.5 M EDTA at 4°C with constant shaking and decalcified bones were immersed 
into 20% sucrose and 2% polyvinylpyrrolidone (PVP) solution for 24 h. Finally, the 
tissues were embedded and frozen in 8% gelatin (porcine) in presence of 20% 
sucrose and 2% PVP. For immunofluorescent stainings and morphological ana- 
lyses, sections were generated using low-profile blades on a Leica CM3050 cryostat. 

For phenotypic analysis, mutant and littermate control samples were always 
processed, sectioned, stained, imaged and analysed together at the same condi- 
tions and settings. For immunostaining, bone sections were air-dried, permeabi- 
lised for 10 min in 0.3% Triton X-100, blocked in 5% donkey serum at room 
temperature for 30 min, and probed with the primary antibodies diluted in 5% 
donkey serum in PBS for 2h at room temperature (RT) or overnight at 4 °C. 

The following primary antibodies were used: Endomucin (sc-65495, Santa Cruz, 
diluted 1:100), Pecam1 conjugated to Alexa Fluor 488 (FAB3628G, R&D Systems, 
1:100), Pecam1 (553370, BD Pharmingen, 1:100), Osterix (sc-22536-R, Santa Cruz, 
1:200), Runx2 (MAB2006, R&D Systems, 1:200), «-SMA-Cy3 (C6198, Sigma, 1:100), 
Hifl-o (ab65979, Abcam, 1:100), Collagen type 1 (AB675P; Millipore, 1:200), Oste- 
opontin (AF808, R&D Systems, 1:200), Mct4 (sc-50329, Santa Cruz, diluted 1:100), 
Glut1 (07-1401, Millipore, 1:100), phospho-p44/42 MAPK (4370, Cell Signaling, 
1:75), Calcitonin receptor (ab11042, Abcam, 1:75), Biotin-conjugated CD45 (553077, 
Becton Dickinson, 1:100), and Hifl-c (ab65979, Abcam, 1:100). After primary antibody 
incubation, sections were washed with PBS for three times and incubated with 
appropriate Alexa Fluor-coupled secondary antibodies (1:400, Molecular Probes) 
for 1 h at room temperature. Nuclei were counterstained with DAPI. Sections were 
thoroughly washed with PBS before mounting them using FluoroMount-G (Southern 
Biotech). Finally, coverslips were sealed with nail polish. 

For metabolic labelling with the hypoxia probe pimonidazole (Pimo, Hypoxyprobe 

Inc.), mutant and control mice were intraperitoneally injected with 60 mg per kg Pimo 
for 2 h before euthanasia. Metabolized Pimo was detected by a rabbit antiserum against 
the non-oxidized, protein-conjugated form of pimonidazole (Hypoxyprobe Inc.). 
Image acquisition and quantitative analysis. Immunofluorescent stainings were 
analysed at high resolution with a Zeiss laser scanning confocal microscope, LSM- 
780. Z-stacks of images were processed and 3D-reconstructed with Imaris soft- 
ware (version 7.00, Bitplane). Imaris, Photoshop and Illustrator (Adobe) software 
was used for image processing in compliance with Nature’s guide for digital images. 
All quantifications were done with ImageJ, Imaris and Volocity software on high- 
resolution confocal images. 
Statistical analysis. All data are presented as mean ~ s.e.m. The significance of 
difference in the mean values was determined using two-tailed Student’s t test 
unless indicated otherwise. P < 0.05 was considered significant. All statistical ana- 
lysis were performed using GraphPad Prism software. No randomization or blind- 
ing was used and no animals were excluded from analysis. Sample sizes were 
selected on the basis of previous experiments. Several independent experiments 
were performed to guarantee reproducibility of findings. 


ARTICLE 


Culture of endothelial cells from bone. Tibiae and femurs from wild-type mice or 
mice expressing endothelial specific reporters (that is, Flk1-GFP*' and Cdh5(PAC)- 
CreERT2"* /R26-mG/mT “ *) were collected in sterile Ca* and Mg” * free PBS, 
crushed with mortar and pestle, and digested with collagenase A (Sigma) to obtain 
a single-cell suspension. Endothelial cells were then MACS sorted using Endomucin 
antibody (cat. no. SC-65495) and Dynabeads sheep anti-Rat IgG (Invitrogen). Sorted 
endothelial cells were then plated on dishes coated with fibronectin and cultured in 
endothelial cell growth medium (EBM-2, Clonetics; Lonza) supplemented with 
EGM-2 SingleQuots (CC-4176, Clonetic; Lonza). At first passage, cells were again 
MACS sorted with Endomucin antibody and plated for culture. Cells were fed 
every third day and passaged upon confluency. Cultures were maintained at 37 °C 
with 5% CO, in a humidified atmosphere. For DFM treatment and subsequent 
qPCR analysis, endothelial cell cultures between passage 2 and 5 were used. Cells 
were treated with DFM (6.25 mg ml! of culture medium) for the duration of 7h 
and subsequently lysed in lysis buffer of RNeasy Mini Kit (Qiagen) for qPCR analysis. 
Quantitative RT-PCR. For the analysis of mRNA expression levels in type H or 
type L endothelium, CD3 1Emen™ and CD31'°Emcn” cells were sorted by FACS 
directly into the lysis buffer of the RNeasy Mini Kit (Qiagen). Total RNA was 
isolated according to manufacturer’s protocol. A total of 100 ng RNA per reaction 
was used to generate cDNA with the iScript cDNA Synthesis System (Bio-Rad). 
Quantitative PCR (qPCR) was performed using TaqMan gene expression assays 
on ABI PRISM 7900HT Sequence Detection System. The FAM-conjugated TaqMan 
probes Tgfb1, Tgfb3, Fgfl, Pdgfa, and Pdgfb were used along with TaqMan Gene 
Expression Master Mix (Applied Biosystems). Gene expression assays were nor- 
malized to endogenous VIC-conjugated Actb probes as standard. For analysis of 
mRNA expression levels from whole bones, dissected femurs were immediately 
crushed finely, digested with collagenase, centrifuged to obtain a pellet, which was 
then lysed in lysis buffer of RNeasy Mini Kit (Qiagen). A total of 500 ng RNA per 
reaction was used to generate cDNA with the iScript cDNA Synthesis System (Bio- 
Rad), which was further processed as described above. FAM-conjugated TaqMan 
probes Ibsp, Bglap, Sp7, Cspg4, Pdgfrb, Runx2 and Sp7 were used along with TaqMan 
Gene Expression Master Mix (Applied Biosystems) to perform qPCR. For the 
mRNA expression analysis of flushed bone and bone collar/endosteum, FAM- 
conjugated TaqMan probes Tgfb1, Tgfb3, Fgfl, Pdgfa and Pdgfb were used. Gene 
expression was normalized to the endogenous VIC-conjugated Actb probes. 
Flow cytometry. For flow cytometric analysis and sorting of type H and type L 
endothelial cells, tibiae and femurs were collected and cleaned thoroughly to remove 
the adherent muscles. The epiphysis was removed and only the metaphysis and 
diaphysis regions were processed. Tibias were then crushed in ice-cold PBS with 
mortar and pestle. Whole bone marrow was digested with collagenase incubation 
at 37 °C for 20 min. Equal number of cells were then subjected to immunostaining 
with endomucin antibody (Santa Cruz, sc-65495) for 45 min. After washing, cells 
were stained with APC-conjugated CD31 antibody (R&D Systems, FAB3628A) for 
45 min. After washing, cells were acquired on BD FACS Canto flow cytometer and 
analysed using BD FACSDiva software (Version 6.0, BD Bioscience). Cell sorting 
was performed with FACS Aria II. 

For demarcating and sorting CD31Emcn* endothelial cells, first standard 
quadrant gates were set, subsequently to differentiate CD31"Emen™ cells from 
the total double positive cells in quadrant 2 gates were arbitrarily set at >10* log 
Fl-4 (CD31-APC) fluorescence and >10* log Fl-2 (Endomucin-PE) fluorescence. 

For the analysis of total ECs in bone, tibiae were processed as described above 

to obtain single cell suspensions, which were stained with biotin-coupled CD45 
(BD, 553077) or Ter119 (BD, 559971) antibodies for 45 min. After washing in 
PBS, cells stained with Streptavidin PE-Cy5 (BD, 554062) and Alexa Fluor488- 
conjugated CD31 (R&D Systems, FAB3628G) antibodies for 45 min. After washing, 
cells were acquired on a FACS Canto flow cytometer and analysed using FACSDiva 
software (Version 6.0, BD Bioscience). Total bone ECs were quantified as CD31* 
CD45 Terl19" . Endomucin was used to distinguish Emcn-negative arterial endothe- 
lial cells from Emcn™ sinusoidal and venous cells. 
Micro-CT analysis and histomorphometry. Tibiae were collected, and the attached 
soft tissue was removed thoroughly and fixed in 4% paraformaldehyde. The fixed 
tibiae were analysed using micro-CT (jtCT 35) and software IPL V5.15 at Scanco 
Medical AG, Switzerland. A voxel size of 12 1m was chosen in all three spatial 
dimensions. For each sample, 148 from 232 slices were evaluated covering a total of 
1.776 mm at a voltage of 70 kVp, intensity 114 1A, integration time 1200 ms. 

For calcein double labelling, mice were given intraperitoneal injections of 10mg 
per kg calcein (Sigma, C0875) dissolved in 2% sodium bicarbonate solution at 
day 10 and day 3 before euthanasia. Undecalcified bones were fixed in 4% PFA, 
embedded in 8% gelatin and 2% PVP and cryosectioned. Sections were stained 
with von Kossa method to assess mineralized bone 

Osteoclast surface/bone surface (Oc. S/BS; %) and osteoclast number/bone perimeter 
(No. Oc./B. Pm) were calculated based on Calcitonin receptor staining of bone sections. 


51. Xu, Y. etal. Neuropilin-2 mediates VEGF-C-induced lymphatic sprouting together 
with VEGFR3. J. Cell Biol. 188, 115-130 (2010). 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a Juvenile / Adult Aged 


metaphysis 


diaphysis 


Type H endothelium 


DFM-treated 


Extended Data Figure 1 | Schematic representation of key findings. a, Type cavity. Arrows indicate the incoming arterial flow and venous drainage. 

H vasculature (red) in the metaphysis (mp) and endosteum (es) representsa _ b, Regulation of type H endothelium by hypoxia-inducible factor. Endothelial- 
functionally specialized vessel subtype that mediates vessel growth and cell-specific gene inactivation of HIF-1« led to pronounced reduction of type H 
promotes osteogenesis. The later is presumably mediated by angiocrine growth _ endothelial cells and osteoprogenitors, whereas the opposite effect was obtained 
factors (small circles). In aged animals (right), the number of type H vesselsand _ by disrupting endothelial VHL expression. Type H endothelial cells, associated 
associated osteoprogenitors (OPs) is strongly reduced so that bone mainly osteoprogenitor cells and bone formation were stimulated by DFM in 
contains type L, sinusoidal vessels characteristic for the diaphyseal (dp) marrow _ aged mice. 
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Extended Data Figure 2 | Regional differences in metabolic marker 
expression. a, Tile scan confocal images showing maximum intensity surface 
projection of Emcn (red) immunostaining on tibial bone section. Nuclei in left 
image are stained with DAPI (blue). Arrowheads mark the exit of the vein 
through the cortical bone (cb). Indicated are growth plate (gp), diaphysis (dp), 
metaphysis (mp) and chondrocytes (ch). b, Schematic representation of 
proximal tibial bone indicating localization of different regions: secondary 
ossification centre (soc), growth plate (gp), metaphysis (mp), diaphysis (dp), 
endosteum (es). c, Representative confocal images showing pimonidazole 
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(green) staining on a tibial section from a 5-week-old mouse. Nuclei, DAPI 
(blue); ECs, Endomucin (red, Emcn). Note abundance of pimonidazole 
staining thoughout the diaphysis (dp) but not in the metaphyseal (mp) region. 
Dashed lines indicate the borders of cortical bone and growth plate (gp). 

d, e, Maximum intensity projections of pimonidazole (green) stained 8-week- 
old tibia. Nuclei, DAPI (blue); CD31 (red, d); CD45 (red, e). Green staining is 
seen in CD45* haematopoietic cells in the diaphysis (dp) and on the bone 
surface (arrowheads). Dashed lines indicate the borders of cortical bone (cb) 
and growth plate (gp). 
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Extended Data Figure 3 | Regional differences in metabolic marker 
expression. a, Representative confocal images showing HIF1-« (green) and 
CD31 (red) immunostaining on sections of 7-week-old tibiae. Nuclei, DAPI 
(blue). Note abundance of HIF1«-positive nuclei in the diaphysis (dp) and 
secondary ossification centre (soc) but not in the metaphyseal (mp) region near 
the growth plate (gp). Dashed lines indicate borders of the growth plate 

(top and centre) or, in panels on the bottom, the endosteum (es). b, Maximum 
intensity projections of tibial sections from 7-week-old mice showing 
immunostaining for the indicated markers. MCT4-positive (green) cells were 
absent in the metaphysis (mp) but abundant in the diaphysis (dp) and 
secondary ossification centre (soc). c, Maximum intensity projections of Glut1 
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(green) immunostaining on sections of 7-week-old tibiae. Nuclei, DAPI (blue). 
Note abundance of Glut1-positive cells in the diaphysis (dp) but not in the 
metaphyseal (mp) region below the growth plate (gp). Dashed lines indicate 
borders of growth plate or endosteum (es), respectively. d, Quantitation of 
HIF1-a, MCT4 and Glut1 immunostaining intensities in metaphysis and 
diaphysis. Data represent mean + s.e.m. (n = 5 mice in two independent 
experiments). P values, two-tailed unpaired t-test. e, Representative tile scan 
confocal image showing the distribution of phospho-ERK1/2 (green) 
immunosignal in sections of 7-week-old tibia. Nuclei, DAPI (blue). Note 
prominent phospho-ERK1/2 staining of cells in the metaphysis (mp) relative to 
the diaphysis (dp). Dashed line indicates border of the growth plate (gp). 
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Extended Data Figure 4 | Structural and marker heterogeneity in bone 
sinusoidal endothelium. a, Representative tile scan confocal image showing 
maximum intensity surface projection of Endomucin (red) immunostaining of 
ECs in the femur of a 2-week-old mouse. Differences in Endomucin staining 
intensity are lost in this projection. Nuclei in left image are stained with DAPI 
(blue). Dashed lines indicate the adjacent growth plate (top), the border 

of the diaphysis (dp) and the morphologically distinct metaphyseal (mp) 

and diaphyseal (dp) vessels. b, Tile scan confocal image showing maximum 
intensity surface projection of GEP* ECs in 2-week-old Flk1-GFP transgenic 
tibia. Left: nuclei, DAPI (blue). Dashed lines indicate the adjacent growth plate 
(top) or the border of the diaphysis (dp). Dashed lines mark borders of the 
growth plate (top) and cortical bone (left and right) as well as the interface 
between column-like metaphyseal (mp) vessels and the highly branched 
diaphyseal (dp) vasculature. c-e, Representative tile scan confocal images of the 
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GFP* (green) endothelium in Cdh5(PAC)-CreERT2, Rosa26-mT/mG double 
transgenic tibiae from 2-week-old (c), 4-week-old (d), or 6-week-old (e) mice 
after postnatal tamoxifen administration. Nuclei, DAPI (blue). GFP signal is 
restricted to vessels and absent in chondrocytes of the growth plate (gp) or 
haematopoietic cells. Dashed lines indicate the borders of the growth plate, 
cortical bone and secondary ossification centre (soc). f, Quantitative analysis 
of relative CD31 and Endomucin immunostaining intensities in the 
microvasculature of the metaphysis, diaphysis (marrow cavity) and endosteum, 
as indicated. Data represent mean = s.e.m. (n = 7 mice from seven 
independent experiments). P values, two-tailed unpaired t-test. g, Mean 
fluorescence intensities (MFI) of CD31™Emcn* and CD31°Emcn” endothelial 
subsets as determined by flow cytometric analysis of bone marrow cells stained 
with CD31 and Endomucin. Data represent mean = s.e.m. (1 = 7 mice two 
independent experiments). P values, two-tailed unpaired t-test. 
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Extended Data Figure 5 | EC subsets in different skeletal elements and (yellow). Arrow in a marks an adjacent artery (green). e, Representative dot 
organs. a-d, Representative tile scan confocal images showing CD31 (green) _ plots showing flow cytometric analysis of CD31 and Endomucin-stained single 
and Endomucin (red) immunostaining in juvenile (4-week-old) vertebra cell suspensions from kidney, heart, spleen, lung, brain and liver. Note absence 
(a), sternum (b), and whole-mount (c) or sectioned (d) calvarium (parietal of a CD31"Emcn* EC subset (orange dashed circle in Q2) in these organs 


bone). Nuclei, DAPI (blue). Arrowheads indicate CD31"Emcn"™ endothelium —_ with exception of liver. 


©2014 Macmillan Publishers Limited. All rights reserved 


Co} 
=.(@) 
aD 

rom 

(f=s 

S, 

o 


ARTICLE 


a = b Mi Flushed Marrow 
' Flushed Tibia 


200 ® Endosteal P=<0.0001 
150 es 
100 ul 


50 

=| i P=<0.0001 
30 

20 P=<0.0001 fa] 


* ~—— Beeggog1 P=<0.0001 
woe 


/CD31/Emen 


HHH 


Relative fold mRNA expression 


Mijuvenile M™laged 


| j 


Metaphysis 


wes A 
Thad 
. P=<0.0001 


Relative fold mRNA expression 


Ry Spe tet 22 eee Pad |P=<0.0001 
QD ry | | *eK 
" G 0.0 
a ex] 
Tow se 3 
So a KX es Ss & 
on 
a 
e f BB juvenile (5 weeks) aged (70 weeks) 
juvenile (5 weeks) aged (70 weeks) = ec 
A el Ten. r S E 
Pe eae 1, 4 F015 £ = 
ee fat Sa x = § 1.00 P=0.0286 
|: : : 7 
8 0.10 2 ee 
© =] Oo 
5 z D 050 
oo P=0.0159 & 5 
2 i 3 025 
um fad 2 2 
0.00 = = 0.00 


3 weeks 


Endomucin 
Fik1 GEP/EdU 


juvenile liver 


4 


Endomucin —>- 


ns | 
Oo 
Lu 
= 
oO 
LL 
oO 
ash 
— 
LL 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


Extended Data Figure 6 | Regional and age related differences in ECs and 
bone. a, Confocal images showing diaphyseal region of flushed tibia 
immunostained for CD31 (green) and Endomucin (red). Nuclei, DAPI (blue). 
Note retention of type H endothelium in the endosteum after flushing. 

b, Comparative qPCR analysis of marrow cell suspension flushed from the 
tibial diaphysis and compact bone plus endosteum (harbouring type H 
endothelium). Shown are expression levels of Pdgfa, Pdgfb, Fgfl, Tgfb1 and 
Tgfb3 mRNAs relative to mRNA for B-actin. Data represent mean + s.e.m. 
(n = 6-8 mice in two independent experiments). P values, two-tailed unpaired 
t-test. c, Representative confocal images from metaphyseal and diaphyseal 
regions of tibias from mice of different ages immunostained for Osterix (green) 
and CD31 (red). Nuclei, DAPI (blue). Dashed lines mark the adjacent growth 
plate (metaphysis) or endosteum (es) in diaphysis. Note striking decline of 
CD31" vessels and associated osteoprogenitors in ageing mice. d, Quantitative 
mRNA expression analysis of Cspg4, Pdgfrb, Runx2 and Sp7 relative to 
transcripts encoding f-actin in long bones from juvenile and aged mice. Note 
significant decline of all 4 markers in bone from aged mice. Data represent 
mean + s.e.m. (n = 7 mice in two independent experiments). P values, 


two-tailed unpaired t-test. e, Representative 1-CT images of tibias from juvenile 
(5-week-old) and aged mice. Note significant loss of bone in aged mice. 

f, Quantitative [1-CT analysis of relative bone volume (bone volume/total 
volume), number of trabeculae and trabecular separation (that is, space 
between trabeculae) in proximal tibias from juvenile mice and aged mice. Data 
represent mean + s.e.m. (1 = 5 mice in two independent experiments). 

P values, two-tailed unpaired t-test. g, FACS plots of CD31 and Endomucin 
double stained single cell suspensions from murine tibias. CD31™Emcn™ ECs 
decline with age. h, Confocal images showing type H endothelium identified as 
GFP* (red) ECs and proliferation (EdU incorporation, green) in the 
metaphysis (mp, upper panel) or diaphysis (dp, lower panel) from 3- or 8-week- 
old FIk1-GFP transgenic tibia, as indicated. EdU* proliferating GFP” cells 
(arrowheads), which represent type H endothelium, are abundant in the 
metaphysis (mp) of juvenile mice and but not adult mice. EU GFP" ECs are 
sparse in sinusoidal (type L) vessels of the diaphysis. i, FACS plots showing 
CD31 and endomucin double staining of single cell suspensions from juvenile, 
adult and aged mice livers. CD31™Emcn™ ECs in liver do not decline with age. 
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Extended Data Figure 7 | Lineage tracing of type H endothelium and 
analysis of HIF pathway mutants. a, b, Representative confocal images of 
sectioned tibiae from the genetic lineage tracing experiment analysed at 1 and 
40 days after tamoxifen administration. GFP-labelled endothelial cells (green) 
seen in Cdh5(PAC)-CreERT2, Rosa26-mT/mG double transgenics after 1 day 
correspond arteries and type H endothelium (arrowheads) in the metaphysis 
(mp) and endosteum, but not in sinusoidal vessels of the diaphysis (dp). 
Counterstaining of ECs with anti-Endomucin antibody (red fluorescence) in 
samples taken after 40 days shows expansion of the GFP* endothelium into the 
diaphyseal microvasculature. Nuclei in small insets in (a), DAPI (blue). Dashed 
lines indicate border of growth plate (gp) and outline of compact bone. 

c, Representative confocal images of metaphyseal (mp) region in tibiae from the 
genetic lineage tracing experiment analysed at day 40 after tamoxifen 
administration showing GFP-labelled ECs (green) in Cdh5(PAC)-CreERT2, 
Rosa26-mT/mG double transgenics and Osterix staining (white). Nuclei stained 
with DAPI (blue). Dashed lines drawn below chondrocytes (ch) lines indicate 


border of growth plate (gp). d, e, Quantitative analysis of CD31Emcen™ 
endothelial cells in long bone from Hifla‘“*© and corresponding littermate 
controls analysed at postnatal day 20 (P20, d) or P37 (e). Shown is fold change 
in frequency of endothelial cells CD31"Emen™ ECs identified by flow 
cytometry. Data represent mean + s.e.m. (n = 7 mice from three independent 
experiments (d); n = 6 mice from three independent experiments (e)). P values, 
two-tailed unpaired t-test. f, Quantitative analysis of CD3 1"'Emcn"™ endothelial 
cells in long bone from Vhi4¥° and corresponding littermate controls 
analysed at postnatal day (P20). Shown is fold change in frequency of 
endothelial cells CD31"Emcn™ endothelial cells identified by flow cytometry. 
Data represent mean + s.e.m. (n = 5 mice from three independent 
experiments). P values, two-tailed unpaired t-test. g, Representative tile scan 
confocal images from tibia sections of control and Vhl“F° mutants 
immunostained for Endomucin (red) and Osteopontin (green). Nuclei, DAPI 
(blue). Note widespread osteopontin staining in VAl“F° mutant tibia. 

gp, growth plate; mp, metaphysis; dp, diaphysis; es, endosteum. 
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Extended Data Figure 8 | VAl‘*®© mutants show increased bone mass. 
a, b, Serum alkaline phosphatase levels in Hifl @iAkc (a) and VhEAEC 
(b) mutants. Data represent mean + s.e.m. (n = 5 or 6 mice for Hifla 
from three independent experiments; n = 6 mice for Vhi“®° from three 
independent experiments). P values, two-tailed unpaired t-test. 

c, Representative t-CT images of tibias from VhE“¢ mutants and littermate 
controls. d-g, Quantitative 1-CT analysis of relative bone volume (bone 
volume/total volume d), trabecular thickness (e) trabecular number (f), and 
trabecular separation (g) in proximal tibia from VhE“®° mutants and their 
littermate controls. Data represent mean = s.e.m. (n = 4 mice from from two 
independent experiments). P values, two-tailed unpaired t-test. Note increased 
bone mass in Vil mutants. h, Calcein double labelling of 5-week-old 
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VhE4£© mutant and littermate control tibiae. i, j, Quantitative analysis of bone 
formation parameters. Mineral apposition rate (MAR; i) and bone formation 
rate/bone surface (BFR/BS; j) for VAE“=° mutants and controls. Data represent 
mean + s.e.m. (n = 6 or 7 mice from three independent experiments). P values, 
two-tailed unpaired t-test. k, Representative confocal images showing 
Calcitonin receptor staining (osteoclasts) in tibia sections from Vapaec 
mutants and littermate controls. Nuclei, DAPI (blue). 1, m, Histomorphometric 
analysis of VhE4*° and control tibiae showing osteoclast number/bone 
perimeter (No. Oc./B. Pm; I) and osteoclast surface/bone surface (Oc. S/B. 

S; m). Data represent mean + s.e.m. (m = 6 mice from three independent 
experiments). P values, two-tailed unpaired t-test. 
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Extended Data Figure 9 | Age-dependent endothelial HIF1-a expression. 
a, Representative confocal images showing HIF1-« (green) and CD31 (red) 
immunostaining on sections of 2-week-old tibia. Nuclei, DAPI (blue). Note 
abundance of HIF1-«-positive type H ECs in 2-week-old metaphysis (mp) but 
not in the type L endothelium in diaphysis (dp). Dashed line marks border of 
growth plate (gp). b, Quantitative mRNA expression analysis of Hifla 
transcripts relative to mRNA encoding f-actin in type H and type L. Data 
represent mean + s.e.m. (n = 3 biological replicates). P values, two-tailed 
unpaired f-test. c, d, Maximum intensity projections of HIF1-« (green) and 
CD31 (red) immunostaining in 7-week-old (c) and 61-week-old (d) tibiae. 
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Nuclei, DAPI (blue). HIF 1-«-positive endothelium was not detected in 
metaphysis (mp) of 7-week-old (c) and 61-week-old tibia (d). e, Maximum 
intensity confocal images from the diaphysis of 5-week-old F/k1-GFP (green) 
irradiated (900 rads) and control tibiae after HIF1-« (red) immunostaining. 
HIF1-« signals (arrowheads) in GEP* endothelial cells are enhanced after 
irradiation. f, qPCR expression analysis of Hifla relative to transcripts encoding 
B-actin in FACS-isolated endothelial cells from bones of irradiated mice and 
untreated controls. Data represent mean + s.e.m. (n = 7 mice from three 
independent experiments). P values, two-tailed unpaired t-test. 
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Extended Data Figure 10 | DFM induction of type H endothelial cells and —_ growth plate chondrocytes (ch) and outline of compact bone. Arrowheads 
osteoprogenitors. a, b, Representative confocal images of CD31 (red, a,b) or indicate Osterix* cells in secondary ossification centre (soc) and DFM-treated 
Osterix (green, b) stained tibia sections from aged DFM-treated (right) and metaphysis. d, Quantitation of Osterix’ osteoprogenitor cells in DEM treated 
control (left) mice (60-65-weeks-old). Low intensity projection shows only and control parietal bones. Data represent mean + s.e.m. (n = 6 mice from 
CD31" cells. DEM induces CD31" vessels and Osterix* osteoprogenitors mice from two independent experiments). P values, two-tailed unpaired t-test. 
(arrowheads). Chondrocytes, ch. c, Tile-scan confocal images of CD31 (red) e, qPCR analysis of Ibsp, Sp7, Bglap and mRNA expression levels relative to Actb 
and Osterix (green, Osx) from metaphysis region of stained tibia sections from _in the aged DFM or vehicle-treated (Control) parietal bones, as indicated. Data 
aged DFM-treated (right) and control (left) mice. Low intensity projection represent mean + s.e.m. (n = 6 or 7 mice from two independent experiments). 
shows only CD3 1" cells. DEM induces CD31" vessels and Osterix* P values, two-tailed unpaired t-test. 
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Unexpected link between an antibiotic, 
pannexin channels and apoptosis 


Ivan K. H. Poon!?*:*, Yu-Hsin Chiu®, Allison J. Armstrong’*%, Jason M. Kinchen!*?, Ignacio J. Juncadella’*?, Douglas A. Bayliss? 


& Kodi S. Ravichandran? 


Plasma membrane pannexin 1 channels (PANX1) release nucleotide find-me signals from apoptotic cells to attract phago- 
cytes. Here we show that the quinolone antibiotic trovafloxacin is a novel PANX1 inhibitor, by using a small-molecule 
screen. Although quinolones are widely used to treat bacterial infections, some quinolones have unexplained side effects, 
including deaths among children. PANX1 is a direct target of trovafloxacin at drug concentrations seen in human plasma, 
and its inhibition led to dysregulated fragmentation of apoptotic cells. Genetic loss of PANX1 phenocopied trovafloxacin 
effects, revealing a non-redundant role for pannexin channels in regulating cellular disassembly during apoptosis. Increase 
in drug-resistant bacteria worldwide and the dearth of new antibiotics is a major human health challenge. Comparing 
different quinolone antibiotics suggests that certain structural features may contribute to PANX1 blockade. These data 
identify a novel linkage between an antibiotic, pannexin channels and cellular integrity, and suggest that re-engineering 


certain quinolones might help develop newer antibacterials. 


Pannexins are four-pass transmembrane channels identified as a new 
family of channels for small molecules (up to ~ 1 kDa) across the plasma 
membrane’”. Among the three vertebrate pannexin family members 
(PANX1, PANX2 and PANX3), PANX1 is the most widely expressed’ and 
is implicated in regulating neutrophil activation’, airway inflammation’, 
human immunodeficiency virus infection*, vasoconstriction®, migraine’ 
and other neurological disorders*’. This broad and diverse range of 
functions may in part arise from pannexin channel-mediated release 
of purines such as ATP into the extracellular space, where purinergic sig- 
nalling can influence multiple physiological processes'*"'. Thus, PANX1 
is an attractive therapeutic target for human diseases and we sought to 
identify small molecules that can modulate PANX1 function. 


Trovafloxacin inhibits PANX1 channel activity 


Caspase-mediated cleavage of PANX1 carboxy terminus during apo- 
ptosis leads to PANX1 channel opening and release of nucleotide find- 
me signals from early apoptotic cells to recruit phagocytes'*"** (Fig. 1a). 
This channel opening also allows the entry of fluorescent dyes, includ- 
ing TO-PRO-3"*> (Fig, 1a). We optimized TO-PRO3 uptake by apop- 
totic Jurkat cells as a reliable, medium-throughput, flow-cytometry-based 
assay for monitoring PANX1] activity. We tested a ‘library of pharma- 
cologically active compounds’ (LOPAC’*”) containing 1,280 small mole- 
cules targeting a diverse range of cellular processes—including currently 
marketed drugs, failed candidates, and bioactive molecules with known 
activities. The initial screen revealed three potential PANX1 inhibitors 
that were tested in secondary screens. Among them, trovafloxacin (a 
quinolone-based antibiotic) was identified as a potent inhibitor of TO- 
PRO-3 uptake by apoptotic cells (Fig. 1b). The use of trovafloxacin in 
patients has been linked to serious adverse side effects, including effects 
on the central nervous system, hepatic toxicity and in some cases mor- 
tality, but the molecular target(s) of trovafloxacin in mammalian cells 
is unclear’®"’. Trovafloxacin inhibition of PANX1 was dose-dependent, 
and comparable to the known pannexin inhibitor carbenoxolone (CBX) 
(Fig. 1c). Trovafloxacin also inhibited ATP release from apoptotic cells 


(Fig. 1d). Importantly, trovafloxacin did not inhibit caspase 3/7 activa- 
tion, or caspase-mediated PANX1 cleavage during apoptosis (Extended 
Data Fig. 1a, b), ruling these out as reasons. 

Several additional analyses suggested trovafloxacin could directly 
target PANX1 channel activity. Adding trovafloxacin to cells already 
undergoing apoptosis (that is, with open PANX1 channels) acutely 
blocked TO-PRO-3 uptake (Extended Data Fig. 1c, d). When we mea- 
sured apoptosis-induced plasma membrane PANX1 currents at the 
single-cell level, by whole-cell patch-clamp recordings, trovafloxacin 
rapidly inhibited the inward current (at —50 mV), with minimal effect 
on outward current (at +80 mV) (Fig. le and Extended Data Fig. 1e). 
We have previously shown that the C-terminal tail of PANX1 blocks 
the channel pore, and that adding excess soluble C-terminal tails can 
inhibit ‘open’ PANX1 channels, especially the inward current (anal- 
ogous to trovafloxacin)'*. In contrast, CBX blocked both inward and 
outward currents’*'*”? (Fig. le, f). Trovafloxacin did not inhibit con- 
nexin 43 gap junction or PANX2 (Extended Data Fig. 1f-i). Using a 
tobacco etch virus (TEV)-protease system to cleave the C-terminal tail 
of recombinant PANX1 and induce channel activity (independent of 
apoptosis)'*™*, trovafloxacin again potently blocked open PANX1 chan- 
nels (Fig. 1g). To test direct channel blocking, we recorded TEV-cleaved 
PANX1 single-channel activity in excised inside-out patch-clamp by 
adding trovafloxacin to the patch; this led to an increase in the time 
spent in the closed state, with open probability (NPo) of ~0.85 in con- 
trol conditions reduced to ~0.15 with trovafloxacin (Fig. 1h). The half 
maximal inhibitory concentration (IC9) of trovafloxacin was ~4 1M 
for the PANX1 inward current (Fig. li), similar to concentrations 
normally achieved in human plasma (2-10 uM)”°”". These data indi- 
cated that mammalian PANX1 channels could be a direct target of anti- 
biotic trovafloxacin. 


Trovafloxacin promotes fragmentation of apoptotic cells 


Next, we investigated trovafloxacin effects on apoptotic cells via micros- 
copy and made several surprising observations. Apart from reducing 
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a a8 b F c Figure 1 | Trovafloxacin inhibits pannexin 1 
rsloase ico 85 sural Cols activity during apoptosis. a, Schematic of PANX1 
. 8 ~ Ce x? with ATP release and TO-PRO-3 uptake by 
Extracellular ( ) = 16 4 se ® & 3 apoptotic cells after caspase-mediated channel 
g R Active E124, ¢ ae opening. b, Subset of LOPAC’™” library screen 
: eka = 2 1 in modulating TO-PRO-3 uptake (median 
Intracellular) { \ n 9: 08 go goxroegr eg ees fluorescence intensity, MFI; average of 1,200- 
qf | =o 6 a [2] Fs 5 ee ee a 1,800), with inhibition by trovafloxacin indicated. 
seubRees Trovafloxacin s 7 a “i c, Dose-dependent inhibition of TO-PRO-3 uptake 
uptake Caspase 3/7 Boinpaunds = ee by trovafloxacin (or positive control CBX), 
levofloxacin and ciprofloxacin (n = 3). 
d Jurkat cells e Jurkat cells + anti-Fas f Jurkat cells + anti-Fas d, Inhibition of ATP release from apoptotic cells by 
120 No anti-Fas =250 1,500 1,500 trovafloxacin (n = 3). e, f, Jurkat cells after 2h with 
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TO-PRO-3 uptake by apoptotic cells, trovafloxacin also induced the 
formation of smaller particles; these ‘fragments of apoptotic cells’ were 
annexin V , indicating phosphatidylserine exposure, and resembled 
apoptotic bodies that arise after cell disassembly during apoptosis”? 
(Fig. 2a, b). To quantify these apoptotic bodies, we designed a flow- 
cytometry-based assay (Extended Data Fig. 2a, b) to analyse five para- 
meters simultaneously: cell size (forward scatter, FSC), cellular complexity 
(side scatter, SSC), annexin V staining (indicating apoptosis), TO-PRO-3 
uptake (PANX1 opening), and 7-aminoactinomycin D (7-AAD) stain- 
ing (loss of plasma membrane integrity). We also performed Image- 
Stream analysis, which combines flow cytometry and image acquisition 
to confirm the categorization of cells and apoptotic bodies (Extended 
Data Fig. 2c). The apoptotic bodies were annexin V'™"™*4!*"° (com- 
pared to annexin V"'2" apoptotic cells), with smaller size and reduced 
complexity (Extended Data Fig. 2). Trovafloxacin promoted the forma- 
tion of apoptotic bodies in a concentration-dependent manner (Fig. 2c, d). 
Importantly, without an apoptotic stimulus, trovafloxacin did not induce 
apoptosis or apoptotic bodies (Extended Data Fig. 1a), indicating that 
trovafloxacin probably modulates steps during apoptosis that promote 
cellular disassembly. Although trovafloxacin itself did not induce apo- 
ptosis, because the human body turns over more than 200 billion cells 
daily via apoptosis as part of homeostasis”, trovafloxacin could affect 
these apoptotic steps. 


PANX!1 regulates the disassembly of apoptotic cells 

Although some studies have addressed apoptotic cell disassembly, the 
mechanisms that control the formation of apoptotic bodies are not 
well understood**”**>”*, Several lines of evidence suggested a key role 
for PANX1 in regulating apoptotic cell integrity/disassembly. First, 
blocking pannexin channels in apoptotic Jurkat cells with CBX or pro- 
benecid (another pannexin inhibitor’’) promoted formation of apoptotic 
bodies (Fig. 2e, f). Second, this effect was not dependent on the mode 
of apoptosis induction, as blocking PANX1 during Fas- or ultraviolet- 
mediated apoptosis promoted formation of apoptotic bodies (Fig. 2f 
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and Extended Data Fig. 3a). Third, using loss of cellular complexity 
during apoptosis as an indication of fragmentation of larger apoptotic 
cells (SSC"®"/higher cellular complexity) into smaller apoptotic bodies 
(SSC!"/lower cellular complexity), blocking PANX1 reduced the number 
of SSC" particles (Fig. 2g). Treating wild-type primary thymocytes 
undergoing apoptosis ex vivo with trovafloxacin, CBX or probenecid 
also led to more apoptotic bodies with reduced cellular complexity 
(Fig. 3a, b). Fourth, the enhanced apoptotic bodies correlated inversely 
with TO-PRO-3 uptake (Extended Data Fig. 3b), with progressively 
more PANX1 channel blocking leading to greater formation of apop- 
totic bodies. Fifth, in a genetic approach, overexpressing a dominant 
negative form of PANX1 (PANX1 DN) with a mutant caspase cleavage 
site that can interfere with endogenous PANX1 function’’ (Extended 
Data Fig. 4a) resulted in increased formation of apoptotic bodies (Fig. 2h). 
Conversely, overexpressing wild-type PANX1 led to fewer apoptotic 
bodies. This ‘gain of function’ due to wild-type PANX1 expression was 
still dependent on PANX1 channel activity, with reversal by PANX1 
blockers (Fig. 2i). Lastly, blocking PANX1 channel functions did not 
affect some known apoptotic features such as DNA fragmentation 
(Extended Data Fig. 4b) or phosphatidylserine exposure (Fig. 2b). Col- 
lectively, these data identify PANX1 as a regulator of cellular disas- 
sembly during apoptosis. 


Trovafloxacin and PANX1 regulate apoptotic cell 
disassembly in vivo 

To address how trovafloxacin and PANX1 might affect apoptosis in a 
whole animal, we used intraperitoneal dexamethasone injection to induce 
synchronous apoptosis of a large fraction of mouse thymocytes in vivo'*”* 
(Fig. 3c). Administering trovafloxacin with dexamethasone promoted 
the fragmentation of the apoptotic thymocytes (Fig. 3d, with fewer 
ssch£" particles; see Extended Data Figs 5 and 6 for gating strategy). 
However, in these experiments we could not track thymic trovaflox- 
acin concentrations, and trovafloxacin in vivo could also have had other 
effects beyond PANX1. Therefore, we generated mice carrying deletion 
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Figure 2 | Trovafloxacin-mediated inhibition of PANX1 promotes 
formation of smaller apoptotic bodies. a, Schematic for apoptosis 
progression and generation of apoptotic bodies, derived from multi-parameter 
time-lapse analyses of Jurkat cells. b, Images of vehicle- or trovafloxacin-treated 
apoptotic cells stained with annexin V or TO-PRO-3. Note the annexin 
veesitive, TO-PRO-3"°"** apoptotic bodies (~0.9-5 jum) after trovafloxacin 
treatment. c, Trovafloxacin increases particles with reduced size (FSC) and 
complexity (SSC). d, Generation of apoptotic bodies is enhanced by 
trovafloxacin, but not levofloxacin and ciprofloxacin (n = 3). e, Left, 


of Panx1, using embryonic stem cells engineered to disrupt the Panx1 
gene (Extended Data Fig. 7). In the thymus of Panx1~'~ and Panx1*! 
mice treated with dexamethasone, the fragmentation of apoptotic cells 
was enhanced (Fig. 3e), essentially phenocopying the effect of trova- 
floxacin treatment in wild- type mice (Fig. 3d). The apoptotic particles 
from Panx1~/~ and Panx1*’~ mice also showed reduced TO-PRO-3 
uptake (Fig. 3e). To rule out that loss of PANX1 in other tissues contri- 
buted to the observed thymic phenotypes in the global knockout mice, 
we specifically deleted Panx1 in thymocytes by crossing the Panx I“ 
mice with Lck-Cre mice, where Cre is expressed in the T-cell lineage 
(Extended Data Fig. 7). Similar apoptotic fragmentation and loss of 
TO-PRO-3 uptake were observed when PANX1 was deleted specif- 
ically in thymocytes (Fig. 3f), indicating a cell-autonomous effect. Ex 
vivo, apopioner thymocytes from PANX1 global knockout mice or the 
Lek-Cre/Panx“ mice also showed enhanced formation of apoptotic 
bodies, decreased TO-PRO-3 uptake and reduced cellular complexity 
(Fig. 3g-i). Collectively, these data indicate that PANX1 critically influ- 
ences cellular integrity of dying cells, with loss of PANX1 function either 
genetically or via trovafloxacin treatment promoting disassembly of 
apoptotic cells. 

One possible mechanism for this increased cellular fragmentation 
is the lack of ATP that would normally be released by apoptotic cells, 
and the various autocrine and paracrine signalling via P2 purinergic 
receptors**’*°, However, adding exogenous ATP to apoptotic cells 
when PANX1 channel activity was blocked did not inhibit apoptotic 
bodies formation (Extended Data Fig. 3c, d). Conversely, adding recom- 
binant apyrase (to hydrolyse ATP) during induction of apoptosis did 
not promote the formation of apoptotic bodies (Extended Data Fig. 3e). 
Furthermore, treatment with suramin, a broad inhibitor of P2 family 
G-protein-coupled receptors, did not enhance formation of apoptotic 
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representative 4h time-lapse images monitoring apoptotic cell morphology 
(454 cells for untreated and 355 cells for CBX-treated, n = 3). Right, percentage 
of cell fragmentation (=2 fragments). f, Apoptotic bodies formation index 
(apoptotic bodies/annexin V* apoptotic cells) is enhanced by PANX1 blockers 
CBX and probenecid (n = 3). g, Left, size (FSC) and complexity (SSC) of 
particles from apoptotic cells with or without CBX. Right, percentage of sschise 
particles. h, i, Formation of apoptotic bodies is increased in Jurkat cells 
expressing PANX1 DN mutant, but decreased in PANX1 WT transfected cells 
(n = 3). The PANX1 WT effect is reversed by CBX. Error bars represent s.e.m. 


bodies (Extended Data Fig. 3f). Thus, the extracellular ATP levels and 
P2 purinergic receptor signalling are probably not directly involved in 
regulating apoptotic cell integrity. 


PANX! regulates formation of string-like apoptopodia 


We next used time-lapse microscopy to monitor cell morphology and 
TO-PRO-3 uptake. In our experimental conditions with Jurkat cells 
induced to undergo Fas-mediated apoptosis, most cells begin to show 
rounding and membrane blebs around 60-90 min, and continue to 
bleb for another 30-60 min, a time period referred here as ‘dynamic 
blebbing’. Remarkably, very few of these blebs detach from the apop- 
totic cell, and after cessation of the dynamic blebbing, these blebs remain 
attached to the dying cell (Fig. 4a, b and Supplementary Video 1). When 
we added TO-PRO-3 into the medium and tracked its uptake as a 
measure of PANX1 activity, TO-PRO-3 staining was detected before 
cell rounding and the onset of blebbing (Fig. 4c and Extended Data 
Fig. 4c). This indicates that PANX1 channels are activated/opened before 
detection of membrane blebbing. When we compared cells with normal 
or impaired PANX1 function (treated with trovafloxacin or CBX, or 
expressing PANX1 DN mutant), a previously unappreciated step dur- 
ing dynamic membrane blebbing regulated by pannexin channels was 
revealed. Specifically, apoptotic cells with impaired PANX1 function 
produced long string-like structures, which we preliminarily denote 
‘apoptopodia’, with blebs of different sizes attached to the ends (Fig. 4a, 
b, dand Supplementary Videos 2 and 3). These apoptopodia were annexin 
V", indicating phosphatidylserine exposure (Fig. 4e). Whereas few 
apoptopodia were seen in apoptotic cells with normal PANX1 function, 
their frequency was substantially increased by blocking PANX1 (Fig. 4f). 
Blocking PANX1 function in live cells (either pharmacologically or 
using the PANX1 DN mutant) did not cause apoptopodia (data not 
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Figure 3 | Pannexin 1 regulates disassembly of apoptotic thymocytes. 

a, b, Increase in formation of apoptotic bodies (a), or decrease in sschish 
particles (b) from dexamethasone (Dex)-treated thymocytes ex vivo by 
trovafloxacin, CBX, and probenecid (n = 3). c, Schematic for testing 
trovafloxacin in thymic apoptosis in vivo. d, Reduction in SSC" apoptotic 
bodies in thymi of wild-type mice treated with Dex+ trovafloxacin (n = 6). 

e, f, Complexity and TO-PRO-3 uptake (fold change relative to unstained) in 
thymi of indicated mice injected intraperitoneally with Dex (n = 10 for 
Panx1*’*, n= 8 for Panx1*’”, n=7 for Panxl ’~,n=8 for Panx! and 
Lck-Cre/Panx i). g, h, Formation of apoptotic bodies (upper) or TO-PRO-3 


shown), indicating that pannexin channels regulate the formation of 
these membrane protrusions after apoptosis induction. These apop- 
topodia were also observed in apoptotic thymocytes from PANX1 
knockout mice, or wild-type thymocytes treated with pannexin inhi- 
bitors (Fig. 3j). Fas-, dexamethasone- or ultraviolet-induced apoptosis 
led to apoptopodia formation when PANX1 function was impaired 
(data not shown). 

We next asked how the apoptopodia formation and membrane bleb- 
bing were linked. PANX1-mediated TO-PRO-3 uptake began before 
the onset of blebbing (Fig. 4c), but the apoptopodia were detectable 
only after the onset of blebbing (Fig. 4b, g). This indicates that PANX1 
activity is probably continuously required during apoptosis progres- 
sion to downregulate the formation of apoptopodia. Interestingly, when 
blebbing was blocked by inhibiting actomyosin contraction or the Rho- 
dependent kinase ROCK (which are needed for apoptotic blebbing”***"), 
apoptopodia formation was unaffected, with noticeable absence of blebs 
at the end of these strings (Extended Data Fig. 8a-c). Importantly, 
inhibiting blebbing reduced the formation of apoptotic bodies in Jurkat 
cells with PANX1 function disrupted by pharmacological block, or in 
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uptake (lower) ex vivo in Dex-treated thymocytes from Panx1~/~ or Lck-Cre/ 
Panxi™" thymocytes. i, Reduction in SSC" subcellular apoptotic particles in 
thymocytes from Panx1~/~ and Lck-Cre/Panx!" mice. (g-i, n = 6 for 
Panx1*’*, n = 8 for Panx1*’~, n = 6 for Panxl~/, n = 4 for Panxl™, n =3 
for Lck-Cre/Panx #™,) j, Time-lapse images monitoring apoptotic cell 
morphology of thymocytes. Arrows, apoptopodia. Error bars represent s.e.m. 
Data in a, b and j are representative of at least two independent experiments. 
See Extended Data Figs 5 and 6 for gating strategy. *P < 0.05, **P < 0.01, 
***D < 0.001, unpaired Student’s two-tailed t-test. 


PANX1-deficient thymocytes (Extended Data Fig. 8d-f). These data 
indicate that the apoptopodia that arise owing to PANX1 inhibition are 
independent of blebbing, but that the annexin V~ vesicular structures 
seen at the end of apoptopodia are dependent on blebbing and contri- 
bute to apoptotic bodies formation. LR73 fibroblasts induced to undergo 
apoptosis also showed the formation of apoptopodia and apoptotic bodies 
when PANX1 function was impaired (Extended Data Fig. 9). Thus, 
trovafloxacin-mediated enhancement of apoptotic bodies formation 
helped uncover a new step during the early stages of apoptosis; spe- 
cifically, pannexin channel function is required to restrain detachment 
of blebs from the apoptotic cells, and in turn, regulate the nature of 
apoptotic cell disassembly (Extended Data Fig. 10). 


PANX1 inhibition varies among different quinolones 

Although quinolone antibiotics are effective at killing bacteria by target- 
ing their topoisomerases, and are used worldwide to treat various bac- 
teria infections****, some quinolones have serious side effects (incidence 
up to 5%) in the liver, skin, tendon, gastrointestinal tract, central nervous 
system and cardiovascular system that are not readily explained’*”. 
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Figure 4 | Formation of string-like apoptopodia after membrane blebbing 
correlates with formation of apoptotic bodies. a, Schematic of apoptotic 
Jurkat cells with normal or impaired PANX1, based on time-lapse microscopy. 
b, Time-lapse images monitoring progression of apoptotic cell morphology. 
c, PANX1-mediated TO-PRO-3 entry occurs before membrane blebbing as 
determined by quantifying TO-PRO-3 uptake by apoptotic cells (n = 15) in 
time-lapse imaging (normalized to first sign of cell rounding). TO-PRO-3 
uptake was undetectable in live cells (n = 11). d, Trovafloxacin-treated 
apoptotic cells or expressing PANX1-DN mutant show formation 

of apoptopodia. e, Apoptopodia have membranes with exposed 
phosphatidylserine and apoptotic blebs at the end of the protrusion. 


Surprisingly, in contrast to trovafloxacin, two other structurally related 
quinolone antibiotics ciprofloxacin and levofloxacin did not block PANX1- 
dependent dye uptake (Fig. 1c), despite all three being equally effective 
in inhibiting bacterial growth (Extended Data Fig. 1j). Comparing the 
molecular structure of trovafloxacin with ciprofloxacin and levoflox- 
acin suggested that certain features of trovafloxacin, such as the fluo- 
rinated ring at position N1 (Fig. 4h), might contribute to its effect on 
PANX1 (Figs 1c and 2d). We next asked whether other quinolones with 
a fluorinated ring at position N1 can inhibit PANX1, and tested diflox- 
acin and tosufloxacin (Fig. 4h) that are currently available commercially 


f, Percentage of apoptotic cells with apoptopodia during 4h of time-lapse 
imaging. g, Apoptopodia are barely detectable before onset of membrane 
blebbing in cells with impaired PANX1 function, but detected after initiation of 
blebbing. (f, g, 429 and 302 cells for untreated and CBX-treated, respectively, 
n = 3.) h, Chemical structures of quinolones used, with the N1 fluorinated ring 
(blue) indicated in quinolones that inhibit PANX1 (trovafloxacin, difloxacin 
and tosufloxacin), compared to quinolones with no apparent PANX1 inhibition 
(ciprofloxacin and levofloxacin) (orange). i, TO-PRO-3 uptake, ATP release, 
and formation of apoptotic bodies from apoptotic Jurkat cells treated with 
trovafloxacin, difloxacin or tosufloxacin (m = 3). Error bars represent s.e.m. Arrows, 
apoptopodia. Data are representative of at least two independent experiments. 


in some Asian countries. Although less potent than trovafloxacin, both 
difloxacin and tosufloxacin partially inhibited ATP release and TO- 
PRO-3 uptake by apoptotic cells, and promoted the formation of apo- 
ptotic bodies (Fig. 4i). These data do not exclude other molecular features 
of trovafloxacin that may also contribute to PANX1 inhibition, such as 
the C7 position of the quinolone (Fig. 4h). 


Discussion 


The data presented here suggest an unexpected but intriguing link 
between the antibiotic trovafloxacin, eukaryotic pannexin channels, 
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and a specific step in progression through apoptosis. Specifically, they 
provide three key insights. First, trovafloxacin, originally marketed by 
Pfizer, showed tremendous promise, but was linked to severe toxicity 
and unexplained deaths among children in a trial in Nigeria, and was 
discontinued’*”*. Because billions of cells are turned over daily via caspase- 
mediated apoptosis in the human body as part of normal life, and tro- 
vafloxacin can directly inhibit the mammalian PANX1 channels at doses 
normally achieved in humans, this could, in part, provide a mechanism 
for trovafloxacin toxicity. Second, our data identify a new role for 
pannexin channels in maintaining cellular integrity during apoptosis. 
Fragmentation of apoptotic cells into smaller apoptotic bodies has been 
seen in vitro and in vivo”, but the mechanism(s) regulating this dis- 
assembly process is not well understood. The trovafloxacin-mediated 
blockage of PANX1 uncovered a previously unappreciated step during 
apoptosis, and revealed an essential and non-redundant role for pan- 
nexin channels in the ordered disassembly of apoptotic cells. Third, the 
dearth of new antibiotics in the pipeline despite the steady increase in 
drug-resistant bacteria worldwide**™ is a significant health threat*’. 
Interestingly, quinolone antibiotics such as ciprofloxacin and levoflox- 
acin do not share the inhibitory effect of trovafloxacin towards the 
mammalian PANX1, and certain features in trovafloxacin might in 
part explain its toxicity. Further studies on the interaction of quino- 
lones with PANX1 and the associated apoptosis pathways may provide 
clues to the underlying reasons for the idiosyncratic and irreversible 
liver toxicity observed with trovafloxacin. Unlocking such clues could 
re-invigorate industrial interest in the quinolone class for the produc- 
tion of safer, and more potent antibacterials. 


METHODS SUMMARY 


For drug screening targeting pannexin channels, Jurkat cells were induced to undergo 
apoptosis (4h) by anti-Fas (250 ng ml!) in the presence of 10 uM of each com- 
pound from the LOPAC’* library (Sigma-Aldrich). Cells were then stained with 
TO-PRO-3 and analysed by flow cytometry. Chimeric mice were generated using 
Panx1-targeted embryonic stem cells (from KOMP repository) and crossed to B- 
actin/Flp mice (JaxMice) to delete the neomycin cassette, and crossed with Ella-Cre 
mice (JaxMice) or Lck-Cre mice (JaxMice) to delete PANX1 globally or specifically 
in thymocytes, respectively. For determining apoptotic cell fragmentation by flow 
cytometry, Jurkat cells were stained with annexin V-FITC, 7-AAD and TO-PRO-3, 
whereas thymocytes from in vivo experiments were stained similarly with annexin 
V-FITC, 7-AAD, TO-PRO-3, CD8a conjugated with phycoerythrin and CD4 con- 
jugated with phycoerythrin-Cy7, with gating as per Extended Data Figs 2, 5 and 6. 
For time-lapse microscopy, cells seeded onto Lab-TekII Chambered Coverglass 
(Nunc) were imaged live using a Zeiss microscope in a 37 °C/5% CO, atmosphere. 
Whole-cell and inside-out patch recordings were made as described'*"* (see Methods). 
A ramp voltage was performed using pCLAMP software and a Digidata 1322A 
digitizer (Molecular Devices), with data analysed by pCLAMP software. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Reagents. Library of Pharmacologically Active Compounds (LOPAC’”*®; http:// 
www.sigmaaldrich.com/chemistry/drug-discovery/validation-libraries/lopac1280- 
navigator.html), trovafloxacin, ciprofloxacin, levofloxacin, difloxacin, tosufloxacin, 
carbenoxolone, probenecid, Y-27632, blebbistatin, cytochalasin D, purified nucle- 
otides, suramin and dexamethasone were obtained from Sigma-Aldrich. 7-AAD 
and TO-PRO-3 were purchased from Invitrogen. Annexin V-FITC, CD8a-PE 
(clone 53-6.7), CD4-PE-Cy7 (clone RM4-5) and anti-mouse CD16/CD32 (clone 
93) were obtained from eBioscience. Other reagents were obtained as follows: anti- 
Fas (clone CH11, Millipore), z-VAD-FMK (Enzo Life Sciences), Q- VD-OPH (SM 
Biochemicals) and recombinant apyrase (New England Biolabs). 

Induction of apoptosis. Jurkat cells in RPMI with 1% BSA were treated with 
250 ng ml —  anti-Fas (clone CH11) or 150 mJ cm * ultraviolet C irradiation (Stra- 
talinker). Primary thymocytes (collected from 5- to 7-week-old C57BL/6 mice, 
male and female) were treated with 50 1M dexamethasone for 5 h. LR73 cells were 
treated with 150mJcm ” ultraviolet C irradiation for 6h. All treatments were 
incubated for indicated times at 37 °C, 5% COs, unless noted otherwise. 

Drug screening for regulators of PANX1 function. Jurkat cells (10° cells ml! 
in DMEM + 0.5% BSA) were induced to undergo apoptosis by anti-Fas treatment 
(250 ng ml‘) inthe presence of 10 1M of compounds in the LOPAC'**° (Sigma- 
Aldrich) for 4 h at 37 °C, 5% CO . Cells were then stained with TO-PRO-3 (0.67 uM) 
for 10 min at room temperature and immediately placed on ice before analysis ona 
BD FACSCanto flow cytometer. The resultant flow cytometric data were analysed 
by FlowJo software (Tree Star). 

Determining cell viability by flow cytometry and ImageStream. Samples were 
stained with annexin V-FITC, 7-AAD and TO-PRO-3 in annexin V binding buffer 
for 10 min at room temperature and immediately placed on ice before analysis ona 
BD FACSCanto flow cytometer, with the resultant flow cytometry data analysed by 
FlowJo software. Samples were also prepared the same way before analysis on an 
ImageStreamX Mark II (Amnis), with the resultant data analysed by IDEAS soft- 
ware (Amnis). 

For TO-PRO-3 uptake assay following acute drug treatment, Jurkat cells in 

RPMI/1% BSA were treated with 500 ng ml ’ anti-Fas (clone CH11) for 4h at 
37 °C, 5% CO>. Following induction of apoptosis, cells were treated with pan cas- 
pase inhibitor Q-VD-OPH (50M) and the indicated concentrations of com- 
pounds for 20 min at 37 °C, 5% COs. Cells were subsequently stained with annexin 
V-FITC, 7-AAD and TO-PRO-3 and analysed on a FACSCanto flow cytometer. 
The resultant flow cytometry data were analysed by FlowJo software. 
Generation of pannexin 1 knockout mice. Panx1-targeted embryonic stem cells 
(EPD0309-3-B01) were obtained from the Knockout Mouse Project (KOMP) Repos- 
itory. After blastocyst injections (performed by the transgenic core facility at the 
University of Virginia), chimaeras were bred with C57BL/6] mice (JaxMice). The 
resultant offspring were crossed with B-actin/Flp mice (JaxMice) to delete the neo- 
mycin cassette, thereby generating mice carrying a floxed Panx1 exon 3 allele (Panx™") 
(Extended Data Fig. 7). To generate Panx1 global knockout mice, Panx !“" mice 
were crossed with Ella-Cre mice (JaxMice) expressing Cre from the two-cell stage 
of embryonic development to delete Panx1 exon 3 in all tissues*’. Resulting mice 
were crossed with C57BL/6] mice to remove Flp and Cre from the background. To 
generate mice that carry deletion of Panx1 specifically in thymocytes, Panxi“! 
mice were crossed with Lck-Cre mice (JaxMice) expressing Cre under the Lck 
proximal promoter, which mediates deletion of Panx1 exon 3 from the double- 
negative stage of thymocyte development*. 
Microscopy. Live imaging was performed on a Zeiss microscope using a X40 or 
X60 oil immersion objective ina 37 °C/5% CO, atmosphere. Jurkat cells at 5 X 10° 
cells ml~' in RPMI+1% BSA (600 ul) were seeded onto Lab-Tek II Chambered 
Coverglass (Nunc) by two successive centrifugations at 20g for 1 min before imag- 
ing. LR73 fibroblasts were cultured overnight on Chambered Coverglass before 
imaging. In certain experiments, TO-PRO-3 and annexin V-FITC were also pre- 
sent during the imaging process. 
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Bacterial growth assay. Overnight Escherichia coli culture was diluted 100-fold 
in LB medium before incubation with quinolones for 6h. Bacterial growth was 
monitored by the absorbance at 600 nm using a FlexStation 3 plate reader, with 
the resultant data analysed using SoftMaxPro 5.4 software. 

Nucleotide measurement. ATP in apoptotic cell supernatants was measured using 
a luciferase/luciferin assay (CellTiter-Glo; Promega) according to manufacturer's 
instructions. 

Caspase activation. Caspase 3/7 activity assays were performed with the Caspase- 
Glo 3/7 (Promega) reagents, in accordance with the manufacturer’s instructions. 
Immunoblotting. Samples were analysed by SDS-PAGE and immunoblotted 
using the following dilutions: anti-GFP (1:1,000; Santa Cruz), anti-ERK2 (1:3,000; 
Santa Cruz), anti-Caspase 3 (1:2,500; Santa Cruz) and affinity purified rabbit anti- 
mouse PANX1 (0.2 pg ml—}), 

Patch-clamp analysis. Electrophysiological recordings were made at room tem- 
perature using an Axopatch 200B amplifier (Molecular Devices). Whole-cell record- 
ings were carried out in Jurkat cells or transiently transfected HEK293T cells with 
borosilicate glass patch pipettes (3-5 MQ). Bath solution contained 140 mM NaCl, 
3mM KCl, 2 mM MgCh, 2 mM CaCl, 10 mM HEPES, and 10 mM glucose (pH 7.3). 
Pipette solution was composed of 30 mM tetraethylammonium chloride, 100 mM 
CsMeSO,, 4mM NaCl, 1 mM MgCl,, 0.5 mM CaCl), 10 mM HEPES, 10 mM EGTA, 
3mM ATP-Mg, and 0.3 mM GTP-Tris (pH7.3). A ramp voltage command was 
applied using pCLAMP software anda Digidata 1322A digitizer (Molecular Devices). 
Data were analysed using pCLAMP software. Results were presented as means 
+ s.e.m. The dose-response curve was fitted and ICs, was determined using Prism 
5 software. Inside-out patch recordings were obtained using patch pipettes (2- 
4 MQ) filled with the bath solution (described above) in a perfusate containing 
150mM CsCl, 5mM EGTA, 1mM MgCl2, 10mM HEPES (pH7.3). Currents 
were evoked from a holding potential of —50 mV. All-point histograms and NPo 
analysis was performed using pCLAMP software. Connexin 43 currents were mea- 
sured in divalent-free bath solution’*"*””. 

DNA fragmentation assay. DNA laddering during apoptosis was characterized 
by agarose gel electrophoresis”. In brief, 60,000 Jurkat cells were induced to undergo 
apoptosis by anti-Fas treatment for 2 h and lysed in TES lysis buffer containing RNase 
for 2 h at 37 °C. Cell lysates were subsequently treated with proteinase K for 18 hat 
50°C and DNA fragmentation was visualized by agarose gel electrophoresis. 
Quantitative PCR. cDNA was synthesized from 50ng of RNA isolated from 
primary thymocytes (RNeasy, Qiagen) using Superscript III (Invitrogen). Quan- 
titative PCR was performed on the ABI StepOnePlus Real-time PCR instrument 
with TaqMan probes (Applied Biosystems). Levels of mouse Panx1 mRNA are nor- 
malized to Gapdh. TaqMan probes used were: mouse Panx1 (Mm00450900_m1) 
and mouse Gapdh (4352339E-1207039). 

In vivo model of apoptosis. 5-week-old C57BL/6 mice (male and female) were 
injected intraperitoneally with 12.5 mg kg” ' dexamethasone. Trovafloxacin (60 mg kg; 
intraperitoneally) was administered 10 min prior to dex injection (12.5 mgkg_'; 
intraperitoneally), and trovafloxacin was also given at 2 and 4h and the analysis 
performed at 6h. Thymocytes were collected 6h post dexamethasone injection, 
stained with annexin V-FITC, 7-AAD, TO-PRO-3, CD8a-PE and CD4-PE-Cy7, 
and analysed on a BD FACSCanto flow cytometer. The University of Virginia 
Animal Care and Use Committee approved all animal experiments. 

Statistical analyses. Data are presented as means + s.e.m. Statistical significance for 
comparisons was determined by unpaired Student’s two-tailed t-test. A P value less 
than 0.05 was considered statistically significant. *P < 0.05, **P < 0.01, ***P < 0.001. 
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Extended Data Figure 1 | Trovafloxacin does not block caspase activation or 
inhibit connexin 43 (Cx43) or pannexin 2 (Panx2) membrane currents. 

a, Caspase 3/7 activation in Jurkat cells undergoing apoptosis is not altered by 
treatment with trovafloxacin (40 UM) (n = 3). b, Proteolytic cleavage of 
PANX1-GFP during apoptosis is not inhibited by trovafloxacin (40 1M) or 
CBX (500 1M) treatment. c, Schematic diagram for the acute treatment of 
apoptotic cells with trovafloxacin or CBX. d, Acute trovafloxacin treatment 
inhibits TO-PRO-3 uptake by apoptotic Jurkat cells. Left, histograms showing 
TO-PRO-3 uptake by viable cells, apoptotic cells, or apoptotic cells treated with 
trovafloxacin or CBX (500 UM) post induction of apoptosis and analysed by 
flow cytometry. Right, uptake of TO-PRO-3 presented as (median fluorescence 
intensity, MFI) of viable cells or apoptotic cells (n = 3). e, Inhibition of 
CBX-sensitive current in apoptotic cells treated with trovafloxacin (20 1M), as 
measured by whole-cell patch-clamp recording (n = 7). f, Patch-clamp 
recordings from HEK293T cells expressing Cx43 and receiving indicated 
treatments. Whole-cell current at +80 mV is shown under conditions when 
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bath solution was perfused with trovafloxacin (20 1M, blue shading) or 
gadolinium (Gd?*) (100 uM, pink shading). g, Current-voltage relationships of 
Cx43 current in HEK293T cells treated with or without trovafloxacin (20 1M) 
or Gd?* (100 pM), with the current measured over a range of voltages. 
Exemplar traces in f and g are representative of 14 cells per group. 

h, Patch-clamp recordings from HEK293T cells expressing mouse Panx2 and 
receiving indicated treatments. Whole-cell current at +80 mV is shown under 
conditions when bath solution was perfused with trovafloxacin (20 1M, blue 
shading) or carbenoxolone (CBX) (50 1M, pink shading). i, Current-voltage 
relationships of Panx2 current in HEK293T cells treat with or without 
trovafloxacin (20 1M) or CBX (50 UM), with the current measured over a range 
of voltages. Exemplar traces in h and i are representative of 4 cells per group. 
j, Trovafloxacin, ciprofloxacin and levofloxacin inhibit bacterial growth. 
Escherichia coli growth (as measured by absorbance at 600 nm) in the presence 
of indicated concentrations of quinolones (n = 3). Error bars represent s.e.m. 
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Extended Data Figure 2 | Electronic gating strategy for the separation of 
different cellular and subcellular population of Jurkat cells undergoing 
apoptosis in vitro. a, Flow cytometric analysis showing each type of particles 
gated (see b below) has a distinctive level of cellular complexity (side scatter, 
SSC), cell size (forward scatter, FSC) as well as TO-PRO-3 (indicative of 
caspase-mediated activation of pannexin 1 channels), 7-AAD (indicative of 
membrane integrity) and annexin V (indicative of phosphatidylserine 


exposure) staining. b, Flow cytometry gating strategy used to distinguish viable 
cells, annexin V~ apoptotic cells, annexin V~ apoptotic cells, annexin V~ 
particles, and apoptotic bodies. c, ImageStream analysis of particles gated using 
the same strategy as described in b. Representative images for each type of 
particles are shown. Jurkat cells were induced to undergo apoptosis by anti-Fas 
treatment (2h) in all indicated experiments. 
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Extended Data Figure 3 | Inhibition of pannexin 1 promotes the formation 
of apoptotic bodies via a mechanism independent of extracellular ATP. 

a, CBX and probenecid enhance the generation of apoptotic bodies from cells 
undergoing ultraviolet-induced apoptosis (n = 3). b, Formation of apoptotic 
bodies after treatment with the indicated concentrations of CBX (n = 3). 

The corresponding TO-PRO-3 uptake by annexin V~ apoptotic cells at each 
CBX concentration is shown above the respective bars. c, d, Addition of 
exogenous ATP during apoptosis induction does not inhibit formation of 


apoptotic bodies in CBX-treated cells (n = 3) (c) or cells stably expressing 
the dominant-negative PANX1 mutant (PANX1 DN mutant) (n = 3) (d). 

e, Removal of extracellular ATP by apyrase does not promote formation of 
apoptotic bodies (n = 3). f, P2Y receptor antagonist suramin does not promote 
formation of apoptotic bodies (n = 3). Jurkat cells were induced to undergo 
apoptosis by anti-Fas treatment in all indicated experiments. Error bars 
represent s.e.m. 
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Extended Data Figure 4 | Pannexin 1 activity does not affect DNA b. c, Time-lapse images monitoring TO-PRO-3 dye uptake during progression 
fragmentation during apoptosis. a, b, TO-PRO-3 dye uptake (n = 3) (a) and _ of apoptosis in Jurkat cells with normal PANX1 function show that TO-PRO-3 
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control vector, the dominant-negative PANX1 mutant (PANX1 DN mutant) _ induced to undergo apoptosis by anti-Fas treatment (2h). Error bars in 
or wild-type PANX1 (PANX1 WT). DNA fragmentation from cells induced to —_a represent s.e.m. 
undergo apoptosis and treated with or without 500 uM CBX is also shown in 


©2014 Macmillan Publishers Limited. All rights reserved 


Wild type thymocytes + Dex (ex vivo) 


Separate permeabilized cells 
(7-AAD high) from other events 
(7-AAD low/intermediate) 


ARTICLE 


Wild type thymocytes + Dex (ex vivo) 


Separate SSC high and 
annexin V low/intermediate 
from other events 


T T T T T 
SOK 100K Je 200K = =250K 


7-AAD ——_"_> 


0 10° 


Annexin V ————————_»> 


Separate annexin V~ apoptotic 
cells (TO-PRO-3 intermediate) 
from viable cells (TO-PRO-3 low) 


eR TTT 
0 107 


Annexin V 


10° 108 10° 


Separate annexin V* apoptotic 
cells (FSC high) from apoptotic 
bodies (FSC low) 


Separate annexin V™ 
particles (Annexin V 
low) from other events 


10°4 


SSC 


TO-PRO-3 


0 T T T T T tl THT Trey Tp 
0 50K 100K 150K 200K 250K 0 10? 10° 104 10° 


3 104 
Fsc ————————> Annexin V ————————_» 


B Viable cells  Permeabilized cells 


HE Annexin V~ apoptotic cells HB Apoptotic bodies 


TO-PRO-3 ———————_»> 


0 50K 


@ Annexin Ve apoptotic cells HH Annexin V~ particles 


Wild type thymocytes + Dex (ex vivo) 


Bright field Annexin V 7-AAD Dark field (complexity) 


Extended Data Figure 5 | Electronic gating strategy for the separation of 
different cellular and subcellular populations of primary thymocytes 
undergoing apoptosis ex vivo. a, Flow cytometry analysis showing each type 
of particle gated according to b has a distinctive level of SSC, FSC as well as 
TO-PRO-3 and annexin V staining. b, Flow cytometry analysis showing 
electronic gating strategy used to distinguish viable cells, annexin V apoptotic 
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cells, annexin V~ apoptotic cells, annexin V_ particles, and apoptotic bodies. 
c, ImageStream analysis of particles gated using the same strategy as described 
in b. Representative images for each type of particle are shown. Primary mouse 
thymocytes were induced to undergo apoptosis by dexamethasone (Dex) 
treatment in all indicated experiments. 
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Extended Data Figure 6 | Electronic gating strategy for analysing the 
complexity of subcellular apoptotic particles generated ex vivo and in vivo. 
a, Flow cytometry analysis showing electronic gating strategy used to 
distinguish annexin V"®", 7-AAD™ subcellular particles generated from 
primary mouse thymocytes induced to undergo apoptosis via dexamethasone 
treatment. Subcellular apoptotic particles with high complexity (SSC high) or 
low complexity (SSC low) are gated as shown. b, Flow cytometry analysis 
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apoptotic cell-derived particles generated in the thymus of mice injected 
intraperitoneally with dexamethasone (6h). Annexin vhish 7-A AD, 
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Extended Data Figure 7 | Generation of conditional and global pannexin genotypes. e, Identification of mice with wild-type, heterozygous and 
1-deficient mice. a, Strategy for deletion of neomycin cassette and exon 3 of | homozygous Panx1-targeted loci, assessed by PCR. f, mRNA levels of Panx1 in 
Panx1. b, Identification of mice with floxed Panx1 loci, assessed by PCR. thymocytes with the indicated genotypes relative to Gapdh. n = 3 mice per 

c, MRNA levels of Panx1 in CD4* thymocytes relative to Gapdh.n =3miceper group. Error bars in c and f represent s.e.m. 
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Extended Data Figure 8 | Formation of apoptotic bodies but not string-like 
apoptopodia structures is dependent on actomyosin contraction. 

a, Time-lapse images monitoring apoptotic cell morphology of cells treated 
with or without CBX (500 1M) and in the presence of actomyosin contraction 
inhibitors. Top right, percentage of apoptotic cells forming string-like 
apoptopodia structures (387, 414, 459 and 372 apoptotic cells were analysed 
for Y-27632, Y-27632+CBX, Cyto-D and Cyto-D+CBX-treated cells, 
respectively, from three independent experiments). b, c, Time-lapse images 
monitoring apoptotic cell morphology of cells stably expressing the 
dominant-negative PANX1 mutant (PANX DN mutant) (b) or treated with 
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40 1M trovafloxacin (c) in the presence of Cyto-D (5 1M). d, Inhibitors of 
blebbing, Y-27632, blebbistatin, or cytochalasin D (Cyto-D) reduce the 
formation of apoptotic bodies in Jurkat cells expressing PANX1 DN mutant 
(n = 3). e, Generation of apoptotic bodies by dying cells treated with Y-27632 
(10 11M), blebbistatin (50 1M) and Cyto-D (5 1M). Cells were induced to 
undergo apoptosis in the presence or absence of CBX (500 11M) (n = 3). f, The 
enhanced formation of apoptotic bodies in apoptotic thymocytes from mice 
with PANX1 deficiency is also blunted by the ROCK inhibitor Y-27632 

(10 11M) that blocks membrane blebbing (n = 3). Error bars represent s.e.m. 
Scale bars, 5 um. Arrows, apoptopodia. 
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Extended Data Figure 9 | Inhibition of pannexin 1 during ultraviolet- 
induced apoptosis in LR73 fibroblasts promotes the formation of 
membrane protrusions and apoptotic bodies. a, ATP levels in supernatants 
of LR73 fibroblasts treated with 40 [tM trovafloxacin with or without apoptosis 
induction (n = 3). b, Formation of apoptotic bodies (left) and TO-PRO-3 
uptake (right) by LR73 fibroblasts treated with the indicated concentrations of 


110. min.> 


trovafloxacin (n = 3). c, Generation of apoptotic bodies by LR73 fibroblasts 
treated with 2mM probenecid (n = 3). d, Time-lapse images monitoring 
apoptotic cell morphology of LR73 fibroblasts treated with or without 
trovafloxacin (40 uM) or probenecid (2 mM). LR73 fibroblasts were induced 
to undergo apoptosis by ultraviolet treatment in all indicated experiments. 
Error bars in a-c represent s.e.m. Arrows, apoptopodia. Scale bars, 10 jum. 
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Extended Data Figure 10 | Schematic diagram depicting where pannexin 1 _ function (for example via trovafloxacin) leads to formation of apoptopodia, and 
likely acts in limiting the fragmentation of apoptotic cells. Blocking PANX1 subsequently the release of apoptotic bodies. 
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The remnant of a merger between two dwarf galaxies 


in Andromeda II 


N. C. Amorisco!, N. W. Evans” & G. van de Ven? 


Driven by gravity, massive structures like galaxies and clusters of 
galaxies are believed to grow continuously through hierarchical mer- 
ging and accretion of smaller systems. Observational evidence of 
accretion events is provided by the coherent stellar streams cross- 
ing the outer haloes of massive galaxies, such as the Milky Way’ or 
Andromeda’. At similar mass scales, around 10"! solar masses in stars, 
further evidence of merging activity is also ample* >. Mergers of lower- 
mass galaxies are expected within the hierarchical process of galaxy 
formation’, but have hitherto not been seen for galaxies with less than 
about 10” solar masses in stars”*. Here we report the kinematic detec- 
tion ofa stellar stream in one of the satellite galaxies of Andromeda, 
the dwarf spheroidal Andromeda II, which has a mass of only 10” 
solar masses in stars’. The properties of the stream show that we are 
observing the remnant of a merger between two dwarf galaxies. This 
had a drastic influence on the dynamics of the remnant, which is 
now rotating around its projected major axis’’. The stellar stream in 
Andromeda II illustrates the scale-free character of the formation of 
galaxies, down to the lowest galactic mass scales. 

Andromeda II is, in size, the second-largest dwarf spheroidal galaxy 
known in the Local Group, with a half-light radius'' of about 1.2 kpc 
(second only to Andromeda XIX’). With a luminosity’ at visible wave- 
lengths of Ly =9.4 x 10°Lo, where Lg is the solar luminosity, it is at 
present 185 kpc from its host and at a heliocentric distance of about 
650 kpc (ref. 13). Among the satellites of M31 (Andromeda), And II is 
one of the few for which a spectroscopic data set of hundreds of stars is 
available. These observations, made using the Deep Imaging Multi- 
Object Spectrograph (DEIMOS) on the KeckII telescope, revealed a 
strong and puzzling stellar rotation that is so far unique among the 
dwarf spheroidal galaxies of the Local Group’’. The results presented 
here are based on a re-analysis of the latter spectroscopic observations, 
provided by N. Ho and M. Geha. 

We assign reliable probabilities of membership to all spectroscopic 
targets, comprising more than 700 candidate red-giant-branch stars, 
by allowing for the presence of both foreground contaminants from 
the Milky Way and interlopers from the halo population of M31. Each 
of the three coexisting components (members, Milky Way halo con- 
taminants and M31 halo contaminants) is described by a distinct spa- 
tial and kinematical distribution, the parameters of which are measured 
by a maximum-likelihood technique (Methods). A Bayesian approach 
then allows us to estimate a probability of membership for each avail- 
able star: we count 632 high-probability members (P > 0.85; Extended 
Data Fig. 1). 

We study the kinematical properties of And II also by a maximum- 
likelihood method, fully taking into account the observational uncer- 
tainties in the line-of-sight velocity of each available giant star in the 
spectroscopic sample. We find that, within measurement errors (med- 
ian value, 6.5kms_ on there are no significant deviations from the strong 
rotation field of And II. However, despite such a smooth mean-velocity 
field and an otherwise flat velocity dispersion profile (Extended Data 
Fig. 2), we identify a drop and asymmetries in the velocity dispersion 
field, especially within the circular annulus 0.9kpc < R < 1.7kpe. 


To quantify the significance of these kinematic anomalies, we isolate 
a group of stars that is defined based only on its spatially connected 
location. Figure 1a displays the giant stars identified as high-probability 
members in the spectroscopic data set, superimposed on a Subaru Prime 
Focus Camera (Suprime-Cam) image’ of And II. We select 134 stars 
over an annular, stream-like region covering an angle of 270° over the 
body of And II (blue points). These are compared with a control sam- 
ple of 319 stars, comprising the remainder of the spectroscopic targets 
at comparable distances from the centre of And II (red points). Although 
sharing a compatible rotational field, the stream-like region is found 
to be kinematically colder (Fig. 1b, c). Figure 1b shows histograms of 
the line-of-sight velocity distribution of the available giant stars in 
both regions, and Fig. 1c shows the normalized generalized histograms 
obtained after subtracting the mean stellar rotation field. Figure 1d 
shows the probability distribution functions of the projected velocity 
dispersion, o. The blue dashed curves in Fig. 1b-d refer to the stream- 
like region, and the red curves are for the control sample. The prob- 
ability that the velocities of the giant stars in the two described regions 
have been extracted from the same parent line-of-sight velocity dis- 
tribution is negligible (P< 3 X 10~°). This shows that, together with 
the stellar population of And II, an additional kinematically colder 
component contributes a substantial fraction of the stars in the selected 
annulus. 

Among the stars in the latter spatially connected stream-like region, 
we next use a Bayesian approach to identify those that are significantly 
better described by the properties of the control sample with a higher 
velocity dispersion, ¢ = 9.3 + 0.6 km !. These are probably And II stars, 
and we find 14 such high-probability contaminants (P > 0.85; Fig. la, 
blue points). If we exclude these, the remaining 120 stars (filled blue dots) 
are characterized by a much lower velocity dispersion (o < 3 km s~'), 
the probability distribution of which is shown by the blue full curve in 
Fig. 1d. We identify these stars with a stream that extends coherently 
over a distance of 5 kpc, with an average thickness of 0.3 kpc. 

The stellar stream contains at least one-tenth of the luminosity of 
And II, which allows us confidently to put a lower bound of Ly 2 10°Lo 
on the total luminosity of the progenitor. Furthermore, current pho- 
tometric data do not suggest that the colour spread of the red-giant- 
branch population of the stream is dissimilar from the considerable 
spread of the And II population itself (Extended Data Fig. 3). Together, 
these point to the progenitor of the stream being a dwarf galaxy with a 
total mass not too different from that of And II. A progenitor with such 
properties is not unexpected: recent kinematic studies of members of 
the Local Group have identified analogue dwarf galaxies with compar- 
ably low velocity dispersions around both the Milky Way”* and M31”. 

The merger had a substantial influence on the dynamics and struc- 
ture of the remnant. The colder stream and warmer control sample do 
not show statistically significant differences in the properties of their 
rotation, which may seem somewhat odd within the merger context. 
However, it is very likely that the orbital angular momentum of the mer- 
ging dwarfs was much larger than the intrinsic net angular momentum 
of the stars in either of the dwarfs. Combined with a mass ratio expected 
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Figure 1 | Kinematic detection of a stream in 
AndII. The targets in the circular annulus 

0.7 kpc < R< 1.9kpc are divided into 
kinematically cold and warm components. 

The 134 stars in the connected stream-like region 
(a, blue points) yield the blue-dashed line-of-sight 
velocity, Vos, distributions in b and c (respectively 
before and after normalization, subtraction of 

the mean stellar rotation field and convolution 
with the individual measurement uncertainties), 
and the blue-dashed probability distribution for 
the projected velocity dispersion, o, in d. Red 
distributions are associated with the kinematically 
warmer control sample, comprising the remaining 
319 spectroscopic targets (a, red points). The 14 
blue open points in a indicate stars that are more 
likely to belong to the main body of And II rather 
than to the stream itself. When they are subtracted 
from the sample of stream stars, the internal 
velocity dispersion of the stream is reduced further, 
leading to the blue full distributions in b-d. 
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to be not too far from unity, the torque exerted by the merging dwarf 
would have been substantial. This is most probably responsible for stars 
of the stream and control sample having the same rotation strength, as 
well as the puzzling orientation around the major axis—stellar rotation 
is nearly always around the minor axis, consistent with oblate axisym- 
metry (except for rotation around the major axis in some giant elliptical 
galaxies caused by triaxiality). At the same time, it is likely that more 
stars that once belonged to the stream’s progenitor are recorded in the 
spectroscopic data set but cannot be clearly distinguished from the stellar 
population of And II because of their lower density contrast. The detec- 
tion of such a stream also provides a natural explanation for the peculiar 
extended component of old stars’ in And II with an effectively constant 
density out to a large radius of about 1.9 kpc; the merger has dynam- 
ically heated the remnant’s stellar population. Although a particularly 
close tidal encounter'® with M31 may also have contributed to shaping 
the structure of And II, this remains very uncertain given the galaxy’s 
unknown proper motion and consequent degeneracy in the modelling”’. 
We measure the line-of-sight velocity of the stream as well as its 
projected spatial position onto the body of And II, and use this informa- 
tion to constrain its approximate orbit. As Fig. 2 shows, a simple model 
of an orbit in a spherical potential is capable of describing the available 
velocity and position measurements. Although it is not possible to infer 
the detailed properties of the gravitational potential of And II, because 
the orbit is found to be almost circular, we are able to constrain the 
enclosed mass (stellar and dark matter) interior to the stream as follows: 
M(<1.5 kpc) =2.5}4 x 10°Mo. This implies a mass-to-light ratio 
of 45+§}Mo/Lo.v, which is typical of dwarf spheroidal galaxies!”"*. 
Even with a characteristic orbital velocity for the stream, dating the 
epoch of the And II merger remains challenging. The survival of cold 
kinematic clumps is strongly dependent on the properties of the grav- 
itational potential in which they orbit'*’°. Nevertheless, the And II 
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stream seems to lie on a nearly circular orbit and never passes close 
to the central regions of the galaxy, which allows it to retain coherence 
for a very long time. At the same time, dynamical friction can drag two 
mutually orbiting dwarf galaxies closer, causing a merger in just a 
fraction of the Hubble time. We estimate that this process requires 
= 3 Gyr, which provides us with an approximate lower limit to assoc- 
iate with the merger of the And II system. 

Streams of disrupted and engulfed galaxies are abundant in the halo 
of the Milky Way, as memorably shown in the Sloan Digital Sky Survey’s 
‘Field of Streams” and by the disrupting Sagittarius galaxy’’. However, 
the frequency and role of accretion onto low-mass galaxies, and in 
particular of dwarf—dwarf mergers, remains unclear, given the extremely 
limited observational evidence. Stellar overdensities similar to those of 
shells have been discovered in the Fornax dwarf spheroidal galaxy”, 
suggesting a late merger origin’’. Irregular isophotes**” and a kine- 
matically cold spot’? indicate that Ursa Minor has suffered recent dis- 
turbance, most probably the accretion of a lower-mass stellar system in 
the form of a stellar cluster. In this respect, And II represents a com- 
pelling case of a dwarf—dwarf merger. 

Mergers between low-mass galaxies are predicted within the hier- 
archical framework of galaxy formation, but they are rare at present 
times. This is particularly true for dwarf satellite galaxies: after accre- 
tion onto their host, in this case M31, the cross-section for encounters 
between previously unrelated dwarf galaxies is very low’, implying 
that subsequent merging activity is essentially limited to galaxies that 
were closely associated before infall. This makes the discovery of a tidal 
stream, originating from the engulfment of one dwarf satellite by another, 
particularly remarkable. As for merger events preceding infall, it is 
estimated® that one in two dark matter haloes with a virial mass of 
10'°Mg have experienced a major merger (mass ratio 2 1/3) between 
redshift 4 and 1, but data to confirm these figures are exceedingly scarce. 
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Figure 2 | The stream reproduced by an orbit in a spherical potential. 

a, Spectroscopic targets belonging to the stream, colour-coded using a 
smoothed velocity field for comparison with the best-fitting orbit. b, c, Line-of- 
sight velocity (b) and distance from the centre of And II (c) of the stars 
belonging to the stream, reproduced using a simple model of an orbit in a 
spherical potential. These data provide a direct comparison between the 
observables (each data point was obtained from a subset of ten stream stars; 
centre values, average; errors, s.d.) and the corresponding 68% confidence 
region obtained from the model. 


Andromeda II provides direct evidence for the importance of mer- 
gers even for the smallest and least luminous of galaxies. Just as for the 
largest giant ellipticals, merging and accretion were dominant processes 
in the formation of the dwarf galaxies we see today. 


METHODS SUMMARY 


Spectroscopic targets that belong to the red-giant-branch population of And II are 
separated from Milky Way dwarf stars in the foreground and interlopers from the 
stellar halo of Andromeda by using maximum-likelihood and Bayesian techniques. 
The three distinct populations have different spatial and kinematical distributions, 
the properties of which are determined by the maximum-likelihood method. The 
extraction of the kinematics of the member stars is also obtained using a max- 
imum-likelihood technique. This includes a full treatment of the individual obser- 
vational uncertainties in the line-of-sight velocity of each spectroscopic target and 
allows for the subsequent kinematic detection and characterization of the stream. 
The luminosity estimate of the identified kinematically cold structure and inferred 
constraints on its progenitors are obtained by using the detailed surface density 
profile of And II, together with the projected region where the density contrast of 
the stream is highest. 
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Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Membership selection. To address the kinematical properties of And II, we need 
to separate the spectroscopic targets that belong to the red-giant-branch popu- 
lation of the dwarf galaxy from any contaminants. Contamination has two different 
origins: dwarf stars in the foreground belonging to the Milky Way and interlopers 
from the stellar halo of Andromeda. To disentangle these three distinct compo- 
nents reliably, we model the spectroscopic data set as a superposition of multiple 
independent stellar populations, within the framework of a maximum-likelihood 
technique”*”*: 


L(0)= TTiP*( (Xj; 07?) P™ (vj; OF") 

The index j runs over the ad aa targets, and i indicates the three popula- 
tions of the model, each containing a fraction of stars f,, with the constraint 
>>; f=1. The probability function P;?, parameterized by the set of parameters 
0°, describes the spatial distribution of the members of the component i on the 
plane of the sky. Given the limited angular size of And II, we can adopt a constant 
surface density distribution for both populations of contaminants, and use a 
Plummer density profile with elliptical isophotes to describe the population of 
And II itself. Analogously, the three components have different probability dis- 
tributions for their kinematics, which are respectively described by the functions 
Pin and the parameters Of". Given the significant separation of the contaminants 
from And II in terms of systematic velocity, we can describe both the Milky Way 
and the Andromeda halo population with a simple Gaussian line-of-sight velocity 
distribution. The observational uncertainty in the measurement of the line-of- 
sight velocity of each single spectroscopic target, 6;, is fully included in the analysis 
by formal convolution; hence, for the contaminants, we have 
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where Milky Way and Andromeda contaminants have their own systematic velo- 
city, v;"*, and intrinsic velocity dispersion, ¢;. The velocity distribution of And II is 
also normally distributed, but has a rotating mean velocity field, with frequency Q: 
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The parameter space is explored by means of a suite of Monte Carlo chains, con- 
structed using the Metropolis—Hastings algorithm”. Once the best-fitting model is 
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identified, for example as described by the set of parameters @" , a Bayesian approach 
allows us to estimate, for each spectroscopic target j, the probability of that indi- 
vidual star belonging to the population i: 
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Extended Data Fig. 1 shows the distribution of all spectroscopic targets in the plane 
(Vio R), where R is the projected radial distance from the centre of And II. Colour- 
coding of each target is set by its probability of membership of And II: the com- 
bination of spatial position and kinematical information is capable of efficiently 
selecting the members of And II. As a further check, we also exclude a few targets 
that appear as fortuitous ‘kinematic members’, but whose membership to And II is 
questioned” either by their colour (V- I> 2.5) or by the strength of the Na1 line 
(EWna1 > 4); which indicate that these may in fact be dwarf foreground stars from 
the Milky Way halo. Asa result, we identify 632 giant stars belonging to And II with 
high probability (P > 0.85). 

Kinematic extraction. Despite significant observational effort and use of the 
DEIMOS spectrograph on Keck, the heliocentric distance of And II, D ~ 650 kpc, 
is so great that uncertainties in the measurement of the line-of-sight velocity of each 
target are significantly higher than figures achieved for the closer Milky Way dwarf 
satellites (6,~2kms '). In particular, with a median of about 6, ~ 6.5 kms}, 
measurement errors are comparable in magnitude to the kinematical spread due to 
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the intrinsic velocity dispersion of the dwarf, which implies that appropriate treat- 
ment of these uncertainties is crucial. 

For this reason, we extract the kinematic properties of And II by fully taking into 
account all measurement errors, individually for each spectroscopic target. First, 
we adopt the line-of-sight velocity distribution of equation (1) to infer the intrinsic 
velocity dispersion, o, in circular annuli as displayed in Extended Data Fig. 2. The 
resulting o profile is flat, as is typically observed in dwarf spheroidal galaxies, 
except for a significant dip around R ~ 1.3 kpc, independently of the precise choice 
of the circular annuli. 

Next we applied the same technique to stars in the circular annulus with 0.9 kpc 

< R < 1.7kpc, but now allowing for two spatially connected components with a 
different intrinsic velocity dispersion. This results in the identification of the 
stream-like region indicated by blue circles in Fig. 1a. The convolution of the 
intrinsic line-of-sight velocity distribution with the error function of each target, 
which is assumed to be normally distributed, is illustrated in Fig. 1c, where we 
show normalized generalized histograms for the velocity distributions of both the 
stream and the control sample. Only by explicitly including each measurement 
error individually is it then possible to infer the probability distributions of the 
intrinsic dispersion for the stream and the control sample (Fig. 1d). Well over 10° 
draws in a suite of Monte Carlo chains have been used to sample the tails of these o 
distributions accurately. This technique also ensures that these o distributions are 
properly marginalized against uncertainties in all other parameters of the model. 
Stream luminosity and progenitor. Our kinematical analysis identifies N.. = 120 
red giant stars as high-probability members of the kinematically cold stream, in a 
pool of Nimem = 632 members of And II. Although indicative of the significant 
luminosity of this structure, it is not entirely correct to use simply the ratio 
Notr/Nmem to derive an estimate of the luminosity of the stream itself. This is 
because the stream covers a limited area over the body of And II and the spectro- 
scopic coverage is neither uniform over the body of the dwarf nor proportional to 
its surface brightness profile. 

We can instead obtain more accurate insight into the luminosity of the stream 
by restricting ourselves to the projected region it covers. This has been identified 
in Fig. 1a as an approximately annular region, centred around R ~ 1.35 kpc and 
covering an angle of ~270°. By using the detailed surface density profile’ of And II, 
obtained by using deep Suprime-Cam data, we calculate that the projected area 
contains ~ 15% of the total luminosity of the dwarf. In turn, we find only 14 high- 
probability contaminants in this region, that is, stars that are significantly more 
likely to belong to the And II population than to the kinematically cold stream. 
This implies a luminosity estimate for the stream of 
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It is worth mentioning that, on the basis of kinematical data alone, we are inev- 
itably capable of identifying only those parts of the stream that have a sufficiently 
high density contrast relative to the And II population, and whose kinematics in 
the stream’s progenitor were cold enough to stand out against the average present 
properties of And II. In particular, as a result of the merger, it is plausible that any 
stellar components that were originally more diffuse in the stream’s progenitor are 
now dispersed in And II, eluding purely kinematical detection. This implies that 
the above 13% (equation (2)) should be regarded as a lower limit for the luminosity 
of the progenitor itself. 

Whereas the resulting luminosity already makes a stellar cluster origin unlikely, 
this can be excluded further by considering the properties of the distribution of 
the stream’s members in the colour-magnitude diagram. Extended Data Fig. 3 
follows the same stream versus control sample labelling as in Fig. 1a, to allow for 
the comparison of the properties of the stream’s member stars with those of the 
And II population. We find that this data suggest little difference in the distri- 
bution of such two populations in the colour-magnitude diagram, highlighting the 
fact that the stream also has a significant colour spread. 


Lana, v ~ 0.13Lanau, v ~ 1.3 X 10°Lo (2) 
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Extended Data Figure 1 | Membership selection. The spectroscopic data set 
in the plane (vos, R), shaded according to the probability of each target 
belonging to the stellar population of And II. Non-member targets with 
velocities higher than the systematic velocity of And II, vy, = —191.4 + 0.4, are 
foreground contaminants from the Milky Way, whereas non-member targets at 
lower negative velocities are interlopers from the Andromeda halo. 
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Extended Data Figure 2 | Velocity dispersion profile. Andromeda II has an 
approximately flat velocity dispersion profile, except for a significant dip near 
the average projected radius of the stellar stream. Points of different sizes 
and shading depths refer to different circular annuli sizes (as in the key), 

and error bars display 68% confidence levels around the most likely central 
value. 
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Extended Data Figure 3 | Colour-magnitude diagram. The distribution of 
the stars belonging to the stream (blue points) and stars in the control sample 
(red points) in V-band magnitude versus V — I colour. 
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Rotationally driven ‘zebra stripes’ in Earth’s inner 


radiation belt 


ALY. Ukhorskiy’, M. I. Sitnov!, D. G. Mitchell!, K. Takahashi", L. J. Lanzerotti? & B. H. Mauk! 


Structured features on top of nominally smooth distributions of 
radiation-belt particles at Earth have been previously associated with 
particle acceleration and transport mechanisms powered exclusively 
by enhanced solar-wind activity’ *. Although planetary rotation is 
considered to be important for particle acceleration at Jupiter and 
Saturn®”’, the electric field produced in the inner magnetosphere 
by Earth’s rotation can change the velocity of trapped particles by 
only about 1-2 kilometres per second, so rotation has been thought 
inconsequential for radiation-belt electrons with velocities of about 
100,000 kilometres per second. Here we report that the distributions 
of energetic electrons across the entire spatial extent of Earth’s inner 
radiation belt are organized in regular, highly structured and unex- 
pected ‘zebra stripes’, even when the solar-wind activity is low. Mod- 
elling reveals that the patterns are produced by Earth’s rotation. 
Radiation-belt electrons are trapped in Earth’s dipole-like magnetic 
field, where they undergo slow longitudinal drift motion around the 
planet because of the gradient and curvature of the magnetic field. 
Earth’s rotation induces global diurnal variations of magnetic and 
electric fields that resonantly interact with electrons whose drift period 
is close to 24 hours, modifying electron fluxes over a broad energy 
range into regular patterns composed of multiple stripes extending 
over the entire span of the inner radiation belt. 

Radiation belts comprise electrons and ions trapped in the near-Earth 
region (distance from the Earth’s centre < 6 Earth radii, Rp) where the 
planetary magnetic field is approximately dipolar. The energy range 
of trapped particles spans multiple orders of magnitude, from about 
10 kilo-electron volts (keV) to multiple tens of mega-electron volts (MeV), 
comprising respectively energetic particles, and relativistic electrons 
and penetrating ions’. The electron population is divided into the inner 
and the outer belts by a so-called slot region where particle loss mech- 
anisms dominate and the electron fluxes are depleted’®”’. 

Measurements from the RBSPICE instruments” on board the twin 
Van Allen Probes spacecraft mission’ show that the lower-energy elec- 
trons (<1 MeV) across the entire inner radiation belt (<3 Rg) are orga- 
nized in regular patterns, which we refer to as “zebra stripes’ (Fig. 1, 
Extended Data Fig. 1). Similar patterns were previously inferred from 
low-altitude electron measurements and interpreted as a manifestation 
of storm-time geomagnetic activity’’. It was suggested that the patterns 
are produced by a resonant interaction of energetic electrons with mono- 
chromatic ultralow-frequency (ULF; ~1 mHz) perturbations of Earth’s 
magnetic field related to enhanced solar-wind driving, and that in order 
to create multiple zebra stripes the perturbations must be composed of 
multiple wave modes with discrete spatial structure. 

From the analysis of a continuous 6-month interval of the RBSPICE 
measurements, we conclude that these zebra stripes are ubiquitous fea- 
tures in the inner radiation belt. The patterns were observed during all 
spacecraft orbits and appeared to be more regular during quiet geomag- 
netic conditions (quiet times) than during geomagnetic storms (Fig. 1). 
Considering the ULF hypothesis, we note that during average geomag- 
netic conditions global monochromatic ~1 mHz perturbations in the 
magnetic field are not typically observed at inner-belt altitudes. The per- 
turbations in this frequency range are directly driven by variations in 


the solar-wind flow!*!°, and these variations have broad-band turbu- 


lent spectra’. The disturbances propagate into the inner belt as mag- 
netosonic waves that typically do not exhibit discrete spatial structure. 
A new explanation is needed for the regular organization of the inner- 
belt electrons that must be viable for both quiet and active geomag- 
netic conditions in the absence of monochromatic ULF wave activity. 

Trapped electrons undergo slow longitudinal drift motion around 
Earth as a consequence of the gradient and curvature of its nearly dipole 
magnetic field'®. The drift frequency, Wp; depends on the electron energy, 
K, and the location measured by L (the radial distance in Rg in the 
equatorial plane) as wp « KL. The energy along the individual zebra 
stripes in the electron energy spectra depends on radial distance as 
K x 1/L, which means that the electron drift frequency along each 
stripe is constant. This is further supported by the distribution of par- 
ticle pitch angles (the angle between a particle’s velocity and the direc- 
tion of the magnetic field) at constant energy values along the stripes. 
The distributions are curved towards larger L values for more field- 
aligned particles, following the dependence of the drift frequency on 
the equatorial pitch angle’ (see Methods and Extended Data Fig. 1). 
The organization of stripes along constant values of the longitudinal 
drift frequency suggests that the zebra stripes are produced in a global 
process that modulates the electron longitudinal drift motion across 
the entire inner belt. 

The only global mechanism in the inner magnetosphere that is inde- 
pendent of solar-wind activity is the rotation of Earth. The rotation induces 
an electric field that controls cold magnetospheric plasma within a <4 Rp 
region around Earth, referred to as the co-rotation region (Fig. 2). Out- 
side the co-rotation region, the plasma is controlled by the electric field 
produced by the interaction of the solar-wind flow with the planetary 
magnetic field’””®. Owing to an ~ 11° offset between the magnetic dipole 
axis and the planetary rotation axis, the entire co-rotation region oscil- 
lates relative to the outer magnetosphere with a period of 24h. This 
relative motion causes global diurnal perturbations of the magnetic field 
over the entire co-rotation region. Because the inner-belt electrons are 
embedded inside the co-rotation region, their drift motion around Earth 
is influenced by the inductive electric field produced by these diurnal 
perturbations. 

To understand how the diurnal perturbations can affect the distribu- 
tion of energetic electrons in the inner belt, we used a test-particle model”? 
(see Methods for details). Because the longitudinal drift motion of the 
inner-belt electrons is well decoupled from their gyro-motion and bounce 
motion’, test-particle trajectories were computed in the guiding centre 
approximation (see Methods) and consideration was restricted to par- 
ticles bouncing at the magnetic equator. Electrons with initially smooth 
energy and radial distributions drifting in a dipole magnetic field were 
subjected to global diurnal oscillations of the electric field. By slowly 
varying the energy and radial position of individual particles, the elec- 
tric field gradually modified the electron distribution into a pattern of 
zebra stripes (Fig. 1d and Supplementary Video 1) that is remarkably 
similar to observations (Fig. la-c). 

The model shows that the single monochromatic diurnal perturba- 
tion of the electric field produces a pattern of multiple seemingly regular 
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Figure 1 | Zebra stripe patterns in energetic electron distributions from the 
inner radiation belt. a-g, Examples of energetic electron spectra measured 
by RBSPICE A, averaged over 45 s, in the inner radiation belt during quiet times 


zebra stripes that extends over a broad energy range (~20-250 keV). 
This result is surprising, because it was previously believed that multiple 
stripes could be formed only if the perturbation spectrum consisted of 
multiple resonant harmonics, The physical mechanism of zebra stripe 
production is illustrated in Fig. 3. The electron drift motion conserves 
the first adiabatic invariant (the magnetic moment,  « KL?) associ- 
ated with the fast gyration of particles perpendicular to the magnetic 
field’*. The change in electron energy is therefore uniquely related to 
the change in its radial position. The phase space of the electron drift 
motion ata given value of the first invariant is specified by its energy K 
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Figure 2 | Global diurnal oscillations of the electric and magnetic fields 
produced by Earth’s rotation. Owing to an offset between Earth’s rotation axis 
and the magnetic dipole axis, the co-rotation region (green at one time, purple 
12h later) oscillates relative to the outer magnetosphere (blue) with a period 
of 24h. This relative motion of two domains of the magnetized plasma 
(indicated by a black double-headed arrow) results in diurnal perturbations of 
the magnetic and the inductive electric fields over the entire co-rotation region 
that affect the inner-belt electrons (orange) embedded in the co-rotation region. 
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(a—c) and during geomagnetic storms (e-g), parameterized by the 9-h averaged 
geomagnetic activity (D,,) index, compared with simulated spectra from a 
test-particle radiation-belt model (d and h). 


and its longitude, measured by the azimuthal angle g. In the absence of 
perturbations in the field, electron energy is conserved and electrons 
drift on circular orbits at constant frequencies defined by their energy. 
A monochromatic diurnal perturbation in the electric field modifies the 
phase space motion. Particles with a drift period of ~24h are trapped 
at resonance with the perturbation in the field. Their drift phases and 
energies oscillate about the resonance values, forming a phase space 
island bounded bya separatrix. The phase space trajectories outside the 
separatrix correspond to non-resonant particles. Over the course of its 
drift period a non-resonant particle has to flow around the resonant island, 
which involves a variation in its energy and a change in the drift fre- 
quency. The magnitude of the change in energy and the instantaneous 
drift frequency depend on the particle’s proximity to the separatrix. 
Consequently, non-resonant electrons with initially identical values of 
energy and radial position but uniformly distributed in the drift phase 
undergo phase mixing, which has a similar effect to the ‘stretching and 
folding’ found in chaotic maps” (Fig. 3b, Supplementary Video 2, Extended 
Data Fig. 2). 

The folds in the electron distribution function produced by the phase 
mixing process are manifested as zebra stripes in the energy spectra. 
The spacing between the phase space folds changes with time, which 
can cause variations in the zebra patterns observed during quiet-time 
conditions (Fig. la—c, Extended Data Fig. 3). Stretching and folding 
leads to a continuous increase in the density of stripes in the zebra pat- 
terns. If the process could continue uninterrupted, a zebra pattern mon- 
itored with a detector of finite energy resolution would eventually merge 
into a seemingly continuous spectrum. The ubiquitous presence of the 
zebra patterns at all analysed spacecraft orbits suggests that the stretch- 
ing and folding process is occasionally disrupted by additional perturba- 
tions, which reset energetic electron distributions to new initial conditions. 

During geomagnetic storms, the co-rotation region and the inner belt 
are affected by additional perturbations driven by enhanced solar-wind 
activity. To understand how that may affect the zebra patterns, we con- 
sidered one possible source of such perturbations, namely, magneto- 
spheric substorms. Substorms involve global reconfigurations of Earth’s 
magnetotail leading to disturbances that can propagate deep into the 
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inner magnetosphere. We modified our test-particle simulations by 
adding a perturbation electric field based ona statistical distribution of 
substorm recurrence frequency~’. The model results (Fig. 1h and Sup- 
plementary Videos 3 and 4) are qualitatively similar to energetic elec- 
tron distributions observed during geomagnetic storms (Fig. le-g). 
We therefore conclude that although patterns of zebra stripes in Earth’s 
trapped electrons are observed at any level of solar-wind activity, enhanced 
activity can substantially modify and even disrupt their structure. Addi- 
tional work is in progress to understand the potential role of ULF waves 
during enhanced geomagnetic activity in the modification of zebra stripes. 


METHODS SUMMARY 


To understand how zebra patterns can be produced by global diurnal oscillations 
of the inductive electric field associated with Earth’s rotation, we used a test-particle 
model that computes the guiding centre motion of equatorially mirroring electrons 
in Earth’s dipole magnetic field. Evolution of an initially smooth distribution of 
the electron phase space density, f(L, K) = A(l-e ““)K_?™, where A is a normal- 
ization constant and the parameters % = 5 and f(L) = — 1.75 — 0.80(L — 1)° were 
chosen to represent average observed electron distributions that we estimated for 
the region between L = 1 and L = 3 and K = 25 keV and K = 250 keV, was com- 
puted with the use of 1.3 X 10° test particles. The dominant effect of diurnal oscil- 
lations in the field was simulated with an azimuthal component of the inductive 
electric field E, = Eosin(g — wot + Wo), where the amplitude Ey = 0.2 mV m', 
g is the azimuthal drift angle (longitude), the frequency » = 2171/To, the period 
To = 24h, and Wo is a random phase shift. The effect of storm-time magneto- 
spheric substorms was simulated by adding a perturbation electric field E, 

E,>>sin(g — wt + Wi), i = 1....5, where E, = 0.32 mV m_ ?,«; = 2n/T; and T; = 2-4h 
corresponding to the most probable recurrence of magnetospheric substorms”. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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Figure 3 | Formation of zebra stripes. a, The 
phase space portrait of electron motion at a 
constant value of the first adiabatic invariant 

(K = 30keV at L = 2) in the frame moving at the 
perturbation phase velocity, 0 (= g — wot). The 
separatrix (red) divides resonant and non-resonant 
trajectories. The bold segments show the intervals 
traversed over a fixed time interval at different 
initial conditions. Particles (filled pink circles with 
arrows) initially distributed over a line segment 
(purple) eventually spread over the phase space 
region (blue). b, A snapshot of the zebra pattern 
produced from an initially smooth distribution. 
The phase space density value (f) is shown with 
colour. 
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METHODS 


Energetic electrons trapped in the approximately dipolar magnetic field in the inner 
belt exhibit quasiperiodic motion, which consists of three components: rapid gyro- 
motion around the guiding centre, the bounce motion of particle guiding centre 
along the field lines between conjugate reflection points, and slow gradient-curvature 
longitudinal drift of the guiding centre around Earth. Each component of the elec- 
tron motion is associated with an adiabatic invariant™*. 

Electron energy along the individual stripes of zebra patterns observed by RBSPICE 
varies as 1/L, which means that the stripes trace constant values of the gradient- 
curvature drift frequency. This, along with the large spatial extent of the stripes, 
suggests that their formation involves a global mechanism that interacts with the 
electron gradient-curvature drift motion associated with the third adiabatic invari- 
ant. An example of zebra stripes is shown in Extended Data Fig. 1. The lower panel 
shows the energy spectrum of energetic electrons with approximately 90° pitch 
angles along a perigee pass of the Van Allen Probes A. The pitch-angle distribution 
of ~230 keV electrons along the pattern is shown in the upper panel of the figure. 
Electrons have a highly anisotropic pitch-angle distribution centred at 90°; a fit with 
the canonical sin”« distribution (where « is the pitch angle) yields n = 4. Zebra 
stripes extend across the entire pitch-angle distribution and are curved towards 
larger L values for more field-aligned particles. The curved shape of the stripes reflects 
the pitch-angle and L dependence of the bounce-averaged gradient-curvature 
drift frequency’ which further points to the fact that the zebra patterns are formed 
in a global process that modifies the electron drift motion. 

To understand how zebra patterns can be produced by global diurnal oscilla- 
tions of the inductive electric field associated with Earth’s rotation, we used a test- 
particle model. Because the spatial and temporal scales of the gradient-curvature 
drift motion of energetic electrons in the inner belt are separated from the spatial 
and temporal scales of their gyro- and bounce motions by orders of magnitude’, 
the formation of zebra stripes does not violate either the first or the second adi- 
abatic invariants of the electron motion. From the conservation of the first invari- 
ant it follows that the electron motion can be described in the guiding centre 
approximation’*”®. Because the stripes are observed at all values of the narrow 
pitch-angle distribution of trapped particles, the mechanism can be illustrated for 
a fixed value of the second invariant. We therefore restrict our consideration to the 
equatorially mirroring particles for which the pitch-angle distribution peaks. 

Ina dipole magnetic field, the guiding centre drift due to the magnetic field gra- 
dient and curvature is in the azimuthal direction. A small-amplitude (that is, E< B) 
electric field causes an additional drift, ug ~ EX B, which drives particle motion in 
the radial direction but does not considerably affect the azimuthal motion domi- 
nated by the gradient drift. In the presence of monochromatic oscillations in the 
electric field produced by Earth’s rotation, the guiding centre motion of equat- 
orially mirroring particles is defined by: 


. E 
p= sin(g—a@pt+w) 
f (L) uc 1 
ve (ReL 


where Ep is the electric field amplitude, «9 = 21/Tp is the electric field frequency, 
To = 24h, 9 is the azimuthal angle, is a phase shift, wp is the frequency of the 
gradient and curvature drift motion, y is the first adiabatic invariant, y is the 
relativistic factor, cis the speed of light, By = 3.11 X 10* nT is magnetic field inten- 
sity on Earth’s surface at the Equator, and Rg = 6.4 X 10° km is Earth’s radius. 
Equations (1) were solved numerically with Ey = 0.2 mV m_' for an ensemble 
of 1.3 X 10° test particles initially evenly distributed between L = 1 and L = 3 with 
energies between K = 25 keV and K = 250keV. Global evolution of the energetic 
electron distributions across the inner belt was computed using Liouville’s the- 
orem: the values of electron phase density were traced along the calculated test- 
particle trajectories backwards in time to the initially smooth distribution: 


f(L,K) =A —e*) KY) (2) 


where the parameters « = 5 and $(L) = —1.75—0.80(L — 1)” were chosen to fit the 
averaged distribution of energetic electrons measured by RBSPICE A. Numerical 
simulations (Fig. 1d and Supplementary Video 1) showed the emergence of zebra 
patterns that are remarkably similar to the patterns observed by RBSPICE A during 
quiet-time conditions (Fig. la-c). 

The mechanism of zebra stripe formation is illustrated in Fig. 3, Supplementary 
Video 2 and Extended Data Fig. 2. Figure 3a shows the phase portrait of equations 
(1) along the surface ofa constant value of the first adiabatic invariant: L «x K~ -_ 
(K = 30keV at L = 2). The system has two different types of solution. Resonant 
particles with a drift period of ~24h are trapped in the potential well formed by 
diurnal oscillations in the electric field. Their azimuthal angle and energy oscillate 
about the resonance values forming a phase space island bounded by a separatrix. 
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The frequency of these nonlinear oscillations depends on particle proximity to the 
resonance which causes the phase mixing: particles with similar initial conditions 
gradually spread over the resonant island. The emergence of complex multi-peak 
distributions during phase mixing of trapped particle populations has been estab- 
lished in multiple physical systems'*”’. Most of the energy range of zebra stripes, 
however, corresponds to non-resonant particles whose phase space trajectories are 
outside the separatrix. Formation of regular phase-space patterns by non-resonant 
particles is attributed to the periodic character of their azimuthal drift around Earth, 
which makes them effectively trapped. On the course of their azimuthal drift motion, 
non-resonant particles flow around the resonance island repeatedly at frequencies 
that depend on their proximity to the separatrix, which causes stretching and folding 
of their distributions, leading to the formation of stripes. 

The process is schematically illustrated by Extended Data Fig. 2. Consider evo- 
lution ofa group of non-resonant particles distributed in azimuthal angle but with 
the same energy and radial position taken from a smooth distribution function that 
initially is independent of the azimuthal angle (panels a, c). All particles of the group 
therefore have the same value of phase space density. Particle motion around the 
resonant island will stretch the distribution and eventually create a fold. Panel d 
corresponds to the moment when points A, B and C are aligned along the azi- 
muthal angle where a spacecraft observation is taken. Since the phase space den- 
sity is conserved along particle trajectories, points A, B and C of the measured 
energy spectrum correspond to the same value of the phase space density. The 
energy spectrum among these three points, however, is produced by particles with 
different initial energies and phase space density values. Consequently, the A, B, C 
fold in the particle distribution results in a local peak in the measured profile of the 
electron phase space density (panel b). Multiple folds will create multiple local 
peaks in the phase space density, which constitute zebra patterns observed by the 
spacecraft. 

As can be seen from Supplementary Video 1, with the model parameters chosen 
for quiet-time geomagnetic conditions it takes about 1 day for a distinct zebra pat- 
tern to emerge from an initially smooth electron distribution. Long-term evolution 
of a zebra pattern under steady-state conditions is illustrated in Extended Data 
Fig. 3, which shows electron intensity (j = p’f) as a function of energy along the 
surface of a constant first adiabatic invariant from the simulation illustrated in 
Fig. 3 and Supplementary Video 2. Stretching and folding leads to a continuous 
increase of the number of stripes associated with the decrease in their width and 
spacing. This can explain variations in the zebra patterns observed by RBSPICE A 
(top panels of Fig. 1). 

If the stretching and folding process could continue uninterrupted, a zebra pat- 
tern monitored with a finite energy resolution detector would eventually merge into 
a seemingly continuous spectrum. However, in the 6-month interval between March 
and August 2013 zebra patterns were observed at all perigee passes. This can be 
explained by a disruption of the stretching and folding process by additional per- 
turbations of the co-rotation region which reset energetic electron distributions to 
new initial conditions. This explanation is supported by the fact that the zebra 
patterns are much more variable during intervals of enhanced geomagnetic activ- 
ity, when the amplitude of the solar-wind-driven perturbations penetrating inside 
the co-rotation region increases. 

To examine how additional disturbances can modify the zebra patterns, we con- 
sidered one possible source of such disturbances: magnetospheric substorms, which 
produce global perturbations of the fields that can penetrate deep into the inner 
magnetosphere. The following electric field term was added to the right-hand side 
of the first equation in system (1): 


5 
Epi =E; >> sin(g—ajt+;) (3) 
i=l 


where co; = 21/T;, and T; were evenly distributed between 2 and 4 h corresponding 
to the statistically determined range of the substorm recurrence, and the phase 
shifts y; were randomly selected between 0 and 21. According to statistical ana- 
lysis, the inductive electric fields associated with substorm dipolarizations are of 
the order of 5 mV m | at L = 4.2 (ref. 28). Although there are no direct measure- 
ments of substorm associated electric field at the inner-belt altitudes, it can be 
expected that the perturbations have become much weaker by the time they reach 
the inner belt. The amplitude E, = 0.32 mV m' was therefore chosen such that 
the root-mean-square amplitude of the perturbation is 0.5 mV m_’, that is, an 
order of magnitude less than the characteristic amplitudes observed at higher 
L-shells. The model results (Fig. 1h and Supplementary Video 3) are qualitatively 
similar to energetic electron distributions observed during geomagnetic storms. 
Supplementary Video 4 shows the results from a simulation with a slightly mod- 
ified perturbation electric field, where the upper limit of the substorm periodicity 
in equation (3) was increased from 4 to 6 h. Considerable differences in zebra pat- 
terns between this and the previous simulations show that the structure of the zebra 
patterns is very sensitive to details of perturbations in the field, which is in agreement 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


with large diversity of zebra patterns observed during disturbed geomagnetic con- 26. Cary, J.R. & Brizard, A. J. Hamiltonian theory of guiding-center motion. Rev. Mod. 


ditions (Fig. le-g). Phys. 81, 693-738 (2009). — 
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Extended Data Figure 1 | A zebra stripe pattern from the RBSPICE A with approximately 90° pitch angles. Upper panel, the pitch-angle distribution 


perigee pass on 30 March 2013. Differential intensity (shown with colour) of _ of the intensity of 230 keV electrons. It shows highly pronounced stripes that 
energetic electrons as function of time along a perigee pass through the inner _are curved towards more field-aligned particles. The ringed numbers plotted 

radiation belt (radial distance L in Earth radii is marked on the top). Bottom —_ along the 230 keV line in the bottom panel and along the 90° values in the upper 
panel, the energy dependence (in keV) of differential intensity for electrons panel show the correspondence of stripes in the energy and pitch-angle spectra. 
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Extended Data Figure 2 | Formation of zebra stripes in the process of 
stretching and folding of the electron phase space density distribution. 

a, c, Smooth initial phase space density independent of the drift angle f(K) 
(a); points A, B and C with different drift angle (0) values but the same energy K 
(radial position) correspond to the same value of the phase space density (c). 


b, d, The moment in the stretching and folding process when points A, B and C 
are aligned along the azimuthal angle where a spacecraft observation is taken 
(d). Points A, B and C are separated in energy at the same level of the phase 
space density, which requires formation of a local peak in phase space density 
(b) corresponding to a stripe of the emerging zebra pattern. 
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Extended Data Figure 3 | Long-term evolution of a zebra pattern under 
steady-state conditions. Four panels show electron intensity (j = p’f) as 
function of energy along the surface of a constant adiabatic invariant (same as 
Fig. 3 and Supplementary Video 2) at different moments (top to bottom, time 
T = 1, 2, 3 and 4 days) of a long-term simulation of electron motion in the 
presence of global diurnal oscillations in the electric field. Stretching and 
folding leads to a continuous increase of the number of stripes associated with 
the decrease in their width and spacing. 
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A fully photonics-based coherent radar system 


Paolo Ghelfi', Francesco Laghezza’, Filippo Scotti’, Giovanni Serafino’, Amerigo Capria’, Sergio Pinna’, Daniel Onori’, 
Claudio Porzi*, Mirco Scaffardi!, Antonio Malacarne’, Valeria Vercesi*, Emma Lazzeri', Fabrizio Berizzi?* & Antonella Bogoni! 


The next generation of radar (radio detection and ranging) systems 
needs to be based on software-defined radio to adapt to variable envi- 
ronments, with higher carrier frequencies for smaller antennas and 
broadened bandwidth for increased resolution’*. Today’s digital 
microwave components (synthesizers and analogue-to-digital con- 
verters) suffer from limited bandwidth with high noise at increasing 
frequencies* ’, so that fully digital radar systems can work up to only 
a few gigahertz, and noisy analogue up- and downconversions are 
necessary for higher frequencies. In contrast, photonics provide high 
precision and ultrawide bandwidth*”, allowing both the flexible gen- 
eration of extremely stable radio-frequency signals with arbitrary 
waveforms up to millimetre waves’ ”, and the detection of such sig- 
nals and their precise direct digitization without downconversion”*”*. 
Until now, the photonics-based generation and detection of radio- 
frequency signals have been studied separately and have not been 
tested in a radar system. Here we present the development and the 
field trial results of a fully photonics-based coherent radar demon- 
strator carried out within the project PHODIR”. The proposed archi- 
tecture exploits a single pulsed laser for generating tunable radar 
signals and receiving their echoes, avoiding radio-frequency up- and 
downconversion and guaranteeing both the software-defined approach 
and high resolution. Its performance exceeds state-of-the-art elec- 
tronics at carrier frequencies above two gigahertz, and the detection 
of non-cooperating aeroplanes confirms the effectiveness and expected 
precision of the system. 

The performance of the transceiver we developed is summarized 
in Table 1, together with a picture of the project demonstrator (Fig. 1). 
The transceiver enables radar systems with unprecedented frequency 
flexibility, signal quality and receiver resolution at high carrier frequency. 
The radar demonstrator has also been tested in a real environment. 

The photonic generation of radio-frequency (RF) signals, avoiding 
microwave oscillators, can be realized either by means of opto-electronic 
oscillators*’”* or by the heterodyning of two lasers in a photodiode”. 
The opto-electronic oscillators show excellent stability up to 40 GHz 
with a miniaturized footprint, but their tunability is limited to a few 


Table 1 | Performance of the photonics-based radar transceiver 


gigahertz, whereas the heterodyning can generate any frequency up to 
the photodiode bandwidth, and is therefore more promising. Never- 
theless, it requires the two lasers to be phase-locked. This can easily be 
obtained by selecting any pair of modes from the spectrum of mode- 
locked lasers (MLLs)'*”, thanks to their intrinsic phase-locking condi- 
tion. Moreover, selecting laser modes with variable wavelength detuning 
allows the flexible production of RF carriers with tunable frequency. 
Finally, the heterodyne method allows us to add an arbitrary modulation 
on the generated RF signal by modulating one of the beating laser modes. 

The receivers for radar systems based on software-defined radio need 
analogue-to-digital converters (ADCs) that accept input signals over 
several tens of gigahertz, with fast sampling rates suitable for broadband 
signals and a high signal-to-noise ratio. Electronic ADCs are limited by 
the aperture jitter of the electronic sampling process—that is, the fluc- 
tuations of the interval duration between two consecutive samples— 
which spoils the accuracy of the digitized signal by adding a noise floor. 
Optical sampling can overcome this limitation”: a train of short optical 
pulses can be directly modulated by the RF signal to be digitized, and 
then the peak amplitude of each optical pulse can be detected and digi- 
tized. Thus, sampling rates of several gigahertz are achievable, and the 
timing jitter of the sampling pulses can be much less than 100 fs. MLLs 
are particularly suitable pulse sources, with narrow pulsewidths and low 
temporal jitter. 

The basic idea of the PHODIR project (“Photonics-based fully digital 
radar’) is to usea single MLL for the realization of both the RF generation 
and sampling, obtaining an entire optics-based transceiver, as sketched 
in Fig. 2 (left), reducing the impact of photonics on the system costs. In 
the band-pass sampling process, which requires the sampling frequency 
to be at least twice the bandwidth of the pass-band signal, the carrier 
frequency of the sampled signal must not be a multiple of the sampling 
frequency. Unfortunately, this is the case when the same MLL is used 
for both generating and sampling the radar signal. This problem can be 
fixed by shifting one of the two MLL modes before heterodyning. The 
frequency shift must be driven by an RF oscillator with a negligible phase 
noise compared to the MLL, which is true if low-frequency oscillators 


Parameter Photonics-based transceiver 


State-of-the-art electronics transceiver 


Transmitter 


Carrier frequency 
Signal jitter 


Flexible direct generation up to 40 GHz 
<15fs; integrated from 10 kHz to 10 MHz 


Direct generation below 2 GHz; up-conversions above 2 GHz 
Typical >20 fs, integrated from 10 kHz to 10 MHz 


Signal-to-noise ratio >73dBMHz? >80dB MHz? 
Spurious-free dynamic range >70dBc >70dBe 
nstantaneous bandwidth 200 MHz, easily extendable with MLL at higher repetition rate <2 GHz 


Receiver 


nput carrier frequency Up to 40 GHz, with direct RF undersampling 


Effective number of bits >7 for carrier frequency up to 40 GHz 


<2 GHz; down-conversions at higher frequencies 


nstantaneous bandwidth 200 MHz, easily extendable with MLL at higher repetition rate <2 GHz 
Sampling jitter <10fs, integrated from 10 kHz to 10 MHz Typical >100 fs, integrated from 10 kHz to 10 MHz 
Spurious-free dynamic range 50 dB >70dB 


<8 for carrier frequency <2 GHz 


Test results of the photonics-based transceiver are shown versus state-of-the-art electronic radar transceivers. 
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Figure 1 | The photonics-based radar transceiver. A picture (taken by A.B.) 
of the field-trial demonstrator. DSP, digital signal processor. 


are used”. Figure 2 (right) shows details of the PHODIR transceiver. 
In the photonics-based RF generator, two tunable optical filters select 
two modes from the spectrum of the MLL. One is modulated by the 
baseband radar pulse in a Mach-Zehnder modulator (MZM), whereas 
the other is shifted in frequency. In the photonics-based ADC, the optical 
pulses from the MLL are modulated in a MZM driven by the received 
echo signal. The modulated pulses are then parallelized in a serial-to- 
parallel converter into multiple lower-rate sample streams so that, after 
opto-electronic conversion, multiple electronic ADCs with low sam- 
pling frequency and high precision can be used to digitize the pulse peak 
power carrying the sample information. The parallel digital samples are 
then interleaved in the digital signal processor to reconstruct the ori- 
ginal signal. To match the pulse width to the photodiode bandwidth, 
spools of dispersion compensating fibre are inserted to broaden the 
optical pulses. 

The precision of photonics is particularly valuable in coherent radar 
systems, which derive the speed of the targets from the Doppler fre- 
quency shift obtained by comparing the instantaneous frequency of the 
echoed signal with that of a continuous-wave reference signal. To allow 
this functionality, the PHODIR transceiver also generates the reference 
signal (see Fig. 2), exploiting a similar structure used for the radar pulse 
but without the modulation. Orthogonal sampling is implemented to 
detect the in-phase and quadrature components of the reference signal, 
allowing us to sample the reference signal only once and to derive the 
quadrature samples by shifting the in-phase samples’. 

Figures 3 and 4 show the results obtained when testing the transceiver 
in the laboratory. The MLL we used generates subpicosecond pulses 
with a repetition rate of 400 MHz. The transmitter that we used differs 
slightly from the scheme in Fig. 2 in that the laser modes are not selected 
by tunable filters. Instead, the spectrum of the MLL is modulated by the 
radar pulse at an intermediate frequency equal to the frequency shift, 
and then photodetected. At the photodiode there are several pairs of 
modes that beat together at different frequencies, including the desired 
one. Given that all the beatings at the same frequency have the same 
phase, at the photodiode output a clear signal can be selected by a RF 
filter at the desired frequency. Therefore, here the tunability is moved 
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Figure 2 | Architecture of the photonics-based radar. Basic structure and 
detailed scheme of principle of the photonics-based coherent radar. S/P, 
series-to-parallel converter, DCF, dispersion compensating fibre. t, time; 
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v, optical frequency; Av, MLL repetition frequency; dv, applied frequency shift; 
f, RF frequency; N, positive integer. 
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Figure 3 | Test results of the photonics-based RF generator. a, Phase noise 
curve of the RF signal optically generated at 39,812 MHz, compared with the 
phase noise curves of the fundamental frequency of the laser at 400 MHz, of its 
hundredth multiple at 40,000 MHz, of the continuous-wave intermediate 
frequency at 188 MHz generated by a direct digital synthesizer, and ofa signal at 
39,812 MHz generated by a synthesizer. b, Amplitude and phase transients of 
a 1-us radar pulse phase-modulated by a 13-bit Barker code with carrier 
frequency of 39,812 MHz. c, RF spectrum of the 13-bit Barker coded radar pulse 
at 39,812 MHz, compared with the RF spectrum of the uncoded pulse. 


in the RF domain by means of selectable RF filters. In any case, this does 
not affect the photonics-based approach because the stability and flex- 
ibility of the generated RF signals are only obtainable using photonics. 

We tested the photonics-based RF generator, producing signals with 
carriers up to 40 GHz (limited by the photodiode bandwidth). As an 
example, Fig. 3 reports the results obtained at 39,812 MHz. Figure 3a 
reports the stability, in terms of the phase noise curve, of the optically 
generated RF carrier, which is obtained applying a shift of 188 MHz to 
the hundredth multiple of the MLL fundamental frequency. The graph 
also reports the phase noise of the detected fundamental frequency of 
the laser at 400 MHz, of its hundredth multiple at 40,000 MHz, and of 
the continuous-wave intermediate frequency at 188 MHz generated by 
a direct digital synthesizer. As expected”, the phase noise curve of the 
40,000 MHz signal is equivalent to the fundamental phase noise curve 


optically sampled at 400 x 10° samples per second and digitized at 
4 X 100 X 10° samples per second. c, Extended Walden plot, comparing the 
performance of electronic and photonic ADCs. 


of the laser shifted up by 40 dB. The signal at intermediate frequency 
shows significantly lower phase noise than that at 40,000 MHz, so the 
signal at 39,812 MHz substantially maintains the noise characteristics 
of the non-shifted signal. The higher noise floor at high offset frequency 
is induced by the necessary RF amplifiers, and does not substantially 
affect the signal stability. Given that the typical radar pulse repetition 
time is about 100 j1s, the phase noise curve is of interest for offset fre- 
quencies from 10 kHz and up. The timing jitter can be calculated by 
integrating the phase noise over this range, which turns out to be 10 fs 
for the fundamental 400 MHz and its multiple at 40,000 MHz, and 15 fs 
for the shifted signal. For comparison, Fig. 3a also reports the phase 
noise curve ofa signal at 39,812 MHz generated by a state-of-the-art syn- 
thesizer, which shows a jitter of 23 fs. Our scheme therefore performs 
significantly better than the synthesizer. A spectrum analysis of the 
photonics-generated carrier also shows a signal-to-noise ratio higher 
than 73 dB ona noise bandwidth of 1 MHz, anda spurious-free dynamic 
range of >70 dBc (decibels relative to the carrier power). Analogous 
results have been obtained generating signals in the range from a few 
hertz to 40 GHz by substituting the RF filter after the photodiode with 
others at a different central frequency, and setting the direct digital 
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synthesizer to generate a different intermediate frequency. This demon- 
strates that the proposed method can effectively generate very stable 
RE signals at arbitrary frequencies over an unprecedented broad range. 
Performance improvements can be obtained by exploiting photonic 
integration techniques. 

By setting the direct digital synthesizer to generate modulated signals 
at intermediate frequency rather than continuous-wave signals, we can 
move the modulation signals on the desired RF carrier. Radar pulses 
can therefore be directly produced on the RF carrier by digitally gener- 
ating pulse-modulated signals at intermediate frequency. Moreover, by 
adding a phase modulation to the digital signal from the synthesizer, 
phase-coded radar pulses can be realized for increasing the radar reso- 
lution through spectral broadening. Figure 3b reports the amplitude 
and phase transients of the RF signal at 39,812 MHz that are produced 
when the intermediate frequency is modulated by 1-,1s rectangular pulses 
with the typical 13-bit Barker code phase modulation. Figure 3c clearly 
shows the expected pulse spectrum broadening from 1 MHz to 13 MHz, 
induced by the phase modulation. The bandwidth could be extended up 
to half the repetition rate of the MLL (200 MHz). These results confirm 
the capability of the scheme to generate arbitrary amplitude-modulated 
and/or phase-modulated signals”. 

Figure 4 reports the results obtained by testing the photonics-based 
ADC of the PHODIR system**. The RF signal is optically sampled at 
400 X 10° samples per second ina MZM with 40-GHz bandwidth. The 
modulated optical pulses are parallelized into four streams at 100 X 10° 
samples per second, exploiting an integrated 1 X 4 LiNbO; switching 
matrix, ensuring an extinction ratio—that is, a power ratio between a 
selected output port and the three disabled ones—of >25 dB (ref. 29). 
The pulses are broadened in four spools of dispersion compensating 
fibre and digitized by four electronic ADCs synchronized with the pulse 
peaks. Finally, a digital signal processor interleaves the samples. The 
photonics-based ADC is tested with input continuous-wave signals up 
to 40 GHz. As examples, Fig. 4a and b report the calculated fast Fourier 
transform for signals at 9,920 MHz and 39,820 MHz, both with 4 x 10,000 
points, after the digital processing. The undersampling process aliases 
the signals in the Nyquist bandwidth (direct current to 200 MHz), at 
80 MHz and 180 MHz, respectively. The noise floor in the two graphs 
appears at different levels owing to the increasing influence of the 
sampling jitter for increasing signal frequency. The residual spurious 
tones after the data post-processing are several tens of decibels below 
the signal, enabling a spurious-free dynamic range of >50 dB. A higher 
spurious-free dynamic range could be obtained with a more precise sam- 
ple parallelization process. By integrating the fast Fourier transform it 
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Figure 5 | Test results of the field-trial demonstrator. a, Trajectory ofa target 
aeroplane. b, Range-velocity map at point A in the trajectory. c, Zoom showing 
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is possible to calculate the signal-to-noise and distortion ratios, and 
from this the effective number of bits. In Fig. 4c the so-called “Walden 
plot’ is shown, reporting the performance comparison in terms of input 
carrier frequency and the effective number of bits of the best electrical 
and photonic ADCs reported so far’, and of the PHODIR receiver. The 
theoretical upper bounds caused by different sampling jitters are also 
reported. The jitter of the MLL obtained by integrating its phase noise 
curve from 10kHz is about 10 fs, whereas the best electronic ADCs 
show an aperture jitter of about 100 fs. As a consequence, our system 
reaches an effective number of bits of 7.4 for an input signal at 9,920 MHz, 
and 7 at 39,820 MHz, performing significantly better than the reported 
electrical ADCs. At 40 GHz, the system performance is close to the the- 
oretical upper bound for 10 fs of aperture jitter, but it moves away from 
the upper bound at lower signal frequencies, indicating that at lower 
frequency the system’s effective number of bits is limited by the spuri- 
ous tones, whereas at higher frequency it is mostly limited by the time 
jitter of the MLL, because the spurious tones become comparable with 
the noise floor. Improvements in the effective number of bits could be 
achieved by including a differential detection scheme to suppress the 
detected noise”, and an improved switching matrix with higher extinc- 
tion to reduce the power of the spurious tones. Moreover, a larger instan- 
taneous bandwidth could be achieved with higher sampling rates. 

The test results of the PHODIR transceiver are summarized in Table 1, 
and compared with the performance of state-of-the-art electronic radar 
transceivers®. The advantages of the photonic approach are evident in 
the extreme frequency flexibility over tens of gigahertz, in the arbitrary 
modulation capability, and in the precision of the digitization for any 
input frequency. We believe that these features will enable the use of 
software-defined radio in future radar systems. 

We also tested the PHODIR transceiver reported above in a field- 
trial demonstrator to determine its effectiveness in a radar system. For 
this purpose, we used a RF front-end (RF circulator, switches, ampli- 
fiers, filters and bistatic antenna) for a signal carrier at 9,900 MHz. Non- 
cooperating (that is, normally operating and unaware of our detection) 
aeroplanes were detected as dynamic targets. For example, Fig. 5a reports 
the trajectory of an observed aeroplane during takeoff. The range-velocity 
map of the target at point A of the trajectory is reported in Fig. 5b. The 
velocity and position of the target were cross-checked by trajectory 
data provided by public websites (http://www. flightradar24.com/). At 
the detection point, the known radial velocity of the target is about 
96 kmh! (fora real speed of about 400 km h_!), and the known dis- 
tance is about 5.5 km. The corresponding detected signal shows the 
target at 5.4km with a radial velocity of 95 kmh’ '. Figure 5c shows a 
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the range and velocity resolution. d, e, Range profile without (d) or with 
(e) pulse coding. 
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two-dimensional zoom of the range-velocity map, with distance reso- 
lution of 150 m and velocity resolution of 2kmh_'. The target shows 
a detected echo 45 dB above the background. Finally, a 13-bit Barker 
code was modulated on the phase of the radar pulses to increase the 
resolution. Figure 5d and e report the range profile collected with and 
without this pulse coding, showing an increase in precision from 150 m 
to 23 m. 

The PHODIR project has pioneered a fully photonics-based radar 
system, enabling the software-defined radio approach with a perfor- 
mance exceeding that of state-of-the-art radar systems. In particular, 
today’s electronic transceivers cannot achieve the same frequency range 
without the use of several parallel architectures, and do not provide an 
equivalent precision, especially at high carrier frequencies. A field-trial 
demonstrator has proved the effectiveness and the expected precision 
of the photonic architecture. This proof-of-concept system has been 
realized using commercial devices, except for the switching matrix which 
was developed within the EU programme NEXPRESSO”. We are now 
planning to develop dedicated photonic integrated circuits, which could 
fully realize the potential of the photonic approach, leading to compact, 
flexible systems with high performance for the most advanced applica- 
tions, such as simultaneous surveillance and communications. The use 
of photonics in radar systems also allow additional functionalities, which 
make its use even more convenient. For example, tunable time delay can 
be applied to the lasers transporting the RF signals to control the beam- 
forming at arrayed antennas*’. Moreover, photonics permits the use of 
optical fibres for transporting the radar signals to and from the antenna, 
allowing its remotization with negligible losses. Therefore, the architec- 
ture we propose, as sketched in Fig. 2, should open up new frontiers in 
the field of radar, enabling future smart multifunction surveillance systems. 


METHODS SUMMARY 


We use the OneFive Origami passive MLL with a pulse repetition frequency of 
400 MHz and an optical bandwidth of about 8 nm at a wavelength of 1,560 nm. In 
photonics-based signal generation, the frequency shift of one of the modes of the 
MLL can be realized, for example, by single-sideband modulation with carrier sup- 
pression. The state-of-the-art synthesizer used as a reference in the analysis of the 
phase noise of the photonics-generated signal is the Agilent E8257D. The inte- 
grated 1 X 4 LiNbO; switching matrix employed at the receiver stage consists of 
three dual-output MZMs in a cascaded configuration. The chromatic dispersion of 
the spools of dispersion compensating fibre before the photodiodes is —300 psnm . 
The electronic ADCs are the data acquisition boards NI PXIe5122 from National 
Instruments, with 14-bit resolution and 100 X 10° samples per second. The digital 
processing implemented to interleave the samples also suppresses the spurious 
tones caused by the nonlinearity of the optical modulator and the different atten- 
uations and delays of the four parallel branches of the photonics-based receiver. 
The synthesizer used for testing the photonic ADC is the Agilent E8257D. In the 
RF front-end of the PHODIR demonstrator, the RF filter at the receiver has a 
bandwidth of 40 MHz. Therefore both the received echo and the reference signal 
are optically sampled at 400 x 10° samples per second, and digitized by acquiring 
their aliased replica with ADCs at 80 X 10° samples per second. The non-modulated 
1-,1s-long radar pulses give a range resolution of 150 m, with a velocity resolution 
of 2kmh ' given by the coherent integration time of 20 ms. In the field trial, we 
rana preliminary system calibration focusing on static targets, detecting objects in 
a range up to 30 km with a transmitted RF peak power of about 20 W. 
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Sulphide oxidation and carbonate dissolution as a 
source of CO, over geological timescales 


Mark A. Torres', A. Joshua West’ & Gaojun Li? 


The observed stability of Earth’s climate over millions of years is 
thought to depend on the rate of carbon dioxide (CO.) release from 
the solid Earth being balanced by the rate of CO, consumption by 
silicate weathering’. During the Cenozoic era, spanning approxi- 
mately the past 66 million years, the concurrent increases in the 
marine isotopic ratios of strontium, osmium and lithium’~* sug- 
gest that extensive uplift of mountain ranges may have stimulated 
CO, consumption by silicate weathering*, but reconstructions of 
sea-floor spreading® do not indicate a corresponding increase in 
CO, inputs from volcanic degassing. The resulting imbalance would 
have depleted the atmosphere of all CO, within a few million years’. 
As a result, reconciling Cenozoic isotopic records with the need for 
mass balance in the long-term carbon cycle has been a major and 
unresolved challenge in geochemistry and Earth history. Here we 
show that enhanced sulphide oxidation coupled to carbonate dis- 
solution can provide a transient source of CO, to Earth’s atmosphere 
that is relevant over geological timescales. Like drawdown by means 
of silicate weathering, this source is probably enhanced by tectonic 
uplift, and so may have contributed to the relative stability of the 
partial pressure of atmospheric CO, during the Cenozoic. A variety 
of other hypotheses*”° have been put forward to explain the ‘Cenozoic 
isotope-weathering paradox’, and the evolution of the carbon cycle 
probably depended on multiple processes. However, an important 
role for sulphide oxidation coupled to carbonate dissolution is con- 
sistent with records of radiogenic isotopes”’, atmospheric CO) par- 
tial pressure’’? and the evolution of the Cenozoic sulphur cycle, 
and could be accounted for by geologically reasonable changes in 
the global dioxygen cycle, suggesting that this CO, source should be 
considered a potentially important but as yet generally unrecog- 
nized component of the long-term carbon cycle. 

It has been proposed that excess CO, uptake from enhanced silicate 
weathering over the Cenozoic could be balanced by a net decrease in 
the size of the organic C reservoir®. However, recent work has shown 
that orogenic activity generally increases organic C burial and stabilizes 
petrogenic organic C, through graphitization’’. Model results also sug- 
gest that global changes in organic matter burial during the Cenozoic 
were insufficient to balance excess CO, consumption (see, for example, 
refs 10, 14). 

The release of CO, from the thermal decomposition of carbonate 
minerals during metamorphism has been suggested to be the missing 
source of CO, required to balance the Cenozoic C cycle’. Petrologic 
and geochronologic data on the timing and potential magnitude of 
associated fluxes call this mechanism into question’*. Additionally, any 
estimate of CO, release during metamorphism is subject to consid- 
erable uncertainty because it is not known what fraction of the CO, 
makes it into the atmosphere after passing through kilometres of vari- 
ably permeable and reactive rock". 

Recent modelling work has suggested that the acceleration of con- 
tinental weathering by Cenozoic uplift was compensated by a decrease 
by a factor of ~2 in the weathering of ocean island basalts in response 
to changing climatic conditions'®. This is not implausible but requires 


very large changes, and there is currently no direct evidence to test this 
prediction. 

Another possibility that has not yet been systematically explored is 
that Cenozoic uplift, in addition to increasing rates of CO, drawdown 
by silicate weathering, increased rates of sulphide oxidation coupled to 
carbonate dissolution, which provided a transient source of CO. The 
oxidation of sulphides (for example, pyrite) produces sulphuric acid 
following the equation 


4FeS, + 150, + 14H,0 — 4Fe(OH); + 8H2SO, (1) 


The dissolution of carbonate minerals with this sulphuric acid results 
in the transfer of sedimentary C into the ocean—atmosphere system as 
CO), which can be represented by the equations 


CaCO; + H,S0,—> CO,(g) + HO + Ca** + $0,?~ (2) 


and 


2CaCO; + H2SO4— 2Ca** + $O,°- + 2HCO;— (3) 


For equation (2), the release of CO2 occurs immediately. For equation 
(3), CO is still released but on the timescale of carbonate precipitation 
in the ocean’® (10° yr), represented by the equation 


Ca”* + 2HCO3;” > CaCO; + CO,(g) + H,O (4) 


A strong correlation between erosion and sulphide oxidation rates'’ 
suggests that orogenic activity, in addition to increasing CO, draw- 
down through enhanced silicate weathering, will also increase CO, 
release from coupled sulphide oxidation and carbonate dissolution. 

Estimates of the modern sulphide oxidation fluxes range from 0.64 
to 1.26 Tmolyr andare based on either the chemistry of large rivers’ 
or the modelled composition of weathering sources'*. Direct measure- 
ments of sulphide-derived S fluxes have been limited, because isotopic 
measurements are generally required to distinguish reliably between 
sulphate and sulphide-derived S. Data from the localities where the 
relative proportions of sulphide and sulphate-derived S have been 
constrained indicate a flux of ~0.2 Tmol yr! of sulphide S, contrib- 
uted by ~2% of global land area’””” *’. Directly extrapolating this value 
globally would yield a flux of 10 Tmol yr’ sulphide S. Although the 
areas with well-constrained data may represent environments with dis- 
proportionately high sulphide oxidations rates, it is nonetheless likely 
that the actual global sulphide oxidation rate has been previously under- 
estimated, or at least lies in the upper range of previous estimates. 

In the localities where both silicate weathering and sulphide oxidation 
rates have been measured, it is possible to compare the rates of CO, 
uptake and release (Fig. 1 and Methods). For the Mackenzie River in 
Canada!””? and the Liwu River in Taiwan*°”!, the calculated release 
rates of CO, associated with sulphide oxidation are greater than the 
uptake rates of CO, by silicate weathering, suggesting that these envir- 
onments release CO, into the atmosphere. For the Ganges-Brahmaputra 
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river system in India’’, the rates of CO; consumption and release are 
comparable, which suggests that all of the CO, consumption by silicate 
weathering is, to first order, balanced by coupled sulphide oxidation 
and carbonate dissolution. 

Ultimately, the CO, released by coupled sulphide oxidation and 
carbonate dissolution is balanced by alkalinity production during sul- 
phate reduction and sulphide precipitation in marine sediments, which 
can be represented by the generalized equation 


2FeOOH + 2H* + Ca?* + 2S0,°- + 4CH,O 
> FeS, + Fe** + 4H,O + Ca?* + 4HCO;— (5) 


The long residence time of sulphate (>10 Myr) relative to Ca?* (1 Myr) 
in the ocean allows for the transient release of sedimentary C into the 
atmosphere as a result of coupled sulphide oxidation and carbonate 
dissolution’®””. 

The controls on the duration of transient CO} release can be explored 
using a conceptual box model of the ocean (Fig. 2 and Methods). In this 
model, the input of carbon from carbonate weathering by sulphuric 
acid (equations (1)-(3)) provides an excess sedimentary contribution 
to the combined ocean-atmosphere pool, represented in our model by 
dissolved inorganic carbon, or DIC equations (1)-(3). The excess sedi- 
mentary DIC is ultimately balanced by DIC derived from the oxidation 
of organic carbon during sulphate reduction (equation (5)), such that the 
ratio of sedimentary DIC to total DIC derived from carbonate weath- 
ering is equal to 0.5 in steady state. For an instantaneous doubling of 
weathering-derived input fluxes, an increase in the ratio of sedimen- 
tary DIC to total DIC is observed for a period of ~70 Myr, reflecting a 
transient release of CO, (Fig. 2). For a doubling of input fluxes that 
occurs over a period of 40 Myr, the transient release of CO) lasts for 
~95 Myr (Fig. 2). These calculated timescales are comparable to the 
duration of enhanced silicate weathering implied by isotopic records, 
suggesting that enhanced sulphide oxidation is potentially important 
in countering the excess CO, drawdown caused by tectonic uplift during 
the Cenozoic. 

Sulphide minerals are concentrated in organic-rich sedimentary rocks 
along with chalcophile elements such as Re and Os (ref. 24). The decay 
of '*’Re to '®’Os leads to elevated '*’Os/'**Os ratios in sulphide- and 
organic-rich sedimentary rocks, making the marine record of '*’Os/"**Os 
sensitive to the weathering of these lithologies over geologic time”. 
Using the modelling framework of ref. 10, which reconstructs changes 
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Figure 1 | CO, uptake (green bars) and release (red bars) rates calculated for 
the Mackenzie River'’”’, the Liwu River”®”' and the Ganges-Brahmaputra 
(G-B)"’. The compiled data are presented in Extended Data Table 1, anda 
description of the calculation method is presented in Methods. Error bars 
reflect the propagation of the analytical uncertainty of each value that was 
reported by the original authors (1 s.d.). For the Ganges-Brahmaputra, the 
uncertainty in the proportion of the sulphate flux derived from sulphide 
oxidation was not reported and, therefore, no error bar is included. 
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Figure 2 | Conceptual model of the timescale of CO, release associated with 
the contrasting timescales of sulphide and carbonate burial. CO) release (red 
portion of the graph) occurs when the ratio of DIC derived from sedimentary 
carbonates (DIC sedimentary) to total DIC (DIC iota) is above 0.5, which is the 
value associated with the dissolution of carbonates by carbonic acid and is 
Pco, neutral (that is, the precipitation of carbonates from a fluid with this 
composition would have no effect on long-term pco, ). a, The timescale of CO, 
release (solid green line) associated with an instantaneous doubling of the input 
fluxes (dashed blue line) from carbonate weathering by carbonic acid and 
sulphuric acid. b, The timescale of CO, release (solid green line) associated with 
a doubling of the input fluxes (dashed blue line) from carbonate weathering by 
carbonic acid and sulphuric acid that occurs progressively over 40 Myr. 


in sedimentary rock weathering based in part on an '*’Os/'**Os mass 
balance, we calculate the CO) release as a result of coupled sulphide 
oxidation and carbonate dissolution during the Cenozoic (Methods). 
The effects of pyrite burial are not included in the model because there 
are few constraints on how burial rates may have varied over the Cenozoic. 

Using the model output of CO, uptake from continental silicate weath- 
ering and release related to sulphide oxidation, we calculate the change 
in the net CO, drawdown from mountain uplift throughout the Ceno- 
zoic (Fig. 3). The reconstructed net CO, drawdown from mountain 
uplift is relatively constant from 50-25 Myr ago and consistent with 
proxy-based reconstructions of the atmospheric CO) partial pressure’, 
Pco, (Fig. 3). From 25-15 Myr ago, there is a period of marked increase 
in the net CO, consumption from mountain uplift that corresponds 
with a decline in poo, (Fig. 3; refs 11, 12). This increase results from a 
mismatch in the shape of the °’Sr/*°Sr and '*’Os/"**Os curves such that 
silicate weathering continues to increase while the weathering of sulphide/ 
organic rich sedimentary rocks remains constant (Fig. 3). After this per- 
turbation, the net CO, drawdown from mountain uplift and the pco, of 
the atmosphere both stabilize at respectively higher and lower values 
relative to 50-25 Myr ago (Fig. 3; refs 11, 12). Lower pco, conditions 
from 15-0 Myr ago would decrease weathering fluxes due to the climate- 
weathering feedback and act to balance the larger net CO, drawdown 
from mountain uplift over this time (see, for example, ref. 10), which is 
consistent with '°Be/*Be evidence for constant global weathering fluxes 
over the past 12 Myr (ref. 25). The general consistency between the proxy- 
based pco, record and the model calculations support the hypothesis 
that sulphide oxidation had an important role in in the Cenozoic C cycle 
and may help, at least in part, to reconcile radiogenic isotope records 
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Figure 3 | Comparison of the isotope mass balance model calculations with 
proxy records for changes in the Cenozoic C cycle. Values of silicate 
weathering, sedimentary rock weathering, and the net drawdown CO, by uplift 
are all relative to modern fluxes with the width of each of the curves reflecting 
the propagated uncertainty (1 s.d.) associated with the model input parameters. 
The shaded period highlights a major increase in the net weathering CO, 
sink that is associated with a drop in atmospheric pco, (refs 11, 12). Errors 
in pco, reflect the range of pco, values for different proxy calibrations (ref. 11 
(light blue) and ref. 12 (dark blue)). 


with atmospheric poo, proxies and the requirement of mass balance in 
the long-term C cycle. 

Enhanced sulphide oxidation in response to Cenozoic uplift should 
also affect the long-term S cycle. Fluid inclusion analyses”® and the iso- 
topic composition of authigenic barites” suggest that marine sulphate 
concentrations increased during the Cenozoic without a correspond- 
ing change in the sulphur isotope ratio. This complicates straightfor- 
ward interpretation of the Cenozoic S cycle because processes that could 
modulate marine sulphate concentrations should also affect sulphur 
isotope ratios. By using an inverse model of the S cycle (Methods), we find 
that our reconstruction of sedimentary rock weathering can be reconciled 
with the marine 5°*S.uiphate (8°"S = ((74S/°7S) sampte/("*S/?*S)vepr — 1) 
x 1,000; VCDT, Vienna-Canyon Diablo Troilite) record for a reas- 
onable range of parameters values (Extended Data Fig. 1), and that the 
corresponding range of CO, release is of similar magnitude to what 
would be required to balance proposed increases in silicate weathering 
(Extended Data Fig. 4). Thus, our hypothesis is consistent with the marine 
O Siniphats record and can simultaneously explain the observed increase 
in sulphate concentrations through the Cenozoic. 
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Figure 4 | Trade-off between CO, release and O, consumption as a result of 
sulphide oxidation. The dashed horizontal lines represent the CO) release 
required to fully compensate for a 10% (blue), 25% (purple) or 50% (orange) 
increase in continental silicate weathering that occurred linearly over the past 
15 Myr, assuming a modern flux of 3.5 X 10'’ mol Myr’ | (Methods). The 25% 
increase corresponds to the increase predicted by the mass balance model of 
ref. 10 and implies po, at 15 Myr ago of ~25%. The estimate for the maximum 
possible atmosphere O, content of 25% (ref. 28) is at odds with modelling” and 
experimental” evidence for higher maximum O, contents. The solid line shows 
the ratio of CO, release to O2 consumption for sulphide oxidation. 


The O, content of the atmosphere depends on the balance between 
the burial and oxidation of reduced carbon and sulphur compounds”. 
An increase in sulphide oxidation will consume O,, with a molar ratio 
of O, consumption to CO, production of 15/8 (equations (1)-(3) and 
Fig. 4). The hypothesis that CO, release as a result of sulphide oxida- 
tion has been important in balancing enhanced CO, consumption 
since the Miocene epoch (as suggested by Fig. 3) would require O, 
consumption during this period. Completely compensating for a 25% 
increase in silicate-weathering CO, drawdown over the past 15 Myr 
(the increase implied in the model of ref. 10 and shown in Fig. 3) would 
mean an early- to mid-Miocene O partial pressure of po, ~ 25%, com- 
pared with the present-day value, of 21% (Fig. 4). There is little inde- 
pendent evidence for how much atmospheric po, varied during the 
Cenozoic era, but changes of the order of ~4% are within geologically 
reasonable ranges***”° and are consistent with estimates derived from 
the S cycle modelling (Methods and Extended Data Fig. 4). If O2 con- 
sumption occurred over the whole Cenozoic, as some of our S isotope 
models suggest (Methods and Extended Data Fig. 1), the additional O, 
demand could require Palaeocene po, values of the order of 23-33% 
(Methods). Importantly, these calculations do not fully capture actual 
expected changes in po, , because the oxygen budget also depends on fluxes 
in the organic C cycle” and the average oxidation state of this carbon, 
which is variable and poorly constrained. The key point is that the 
trade-off between O, and CO, places a practical limit on geologically 
reasonable CO) production by sulphide oxidation, but the relative mag- 
nitudes of the reservoirs and fluxes mean that sulphide oxidation can 
have an important role in balancing silicate-weathering-driven CO 
consumption if accompanied by modest variation in atmospheric po,. 

It is unlikely that the evolution of the C cycle during the Cenozoic 
depended alone on changes in silicate weathering balanced by sulphide- 
carbonate weathering; additional CO, release from metamorphic dec- 
arbonation””’, decreasing rates of ocean-island-basalt weathering associated 
with global climate-weathering feedbacks’®, and changes in the organic 
C cycle may also have been important. Although many questions remain, 
including about the coincident variation in po, , sulphide oxidation coupled 
to carbonate dissolution does provide an attractive and thus far under- 
recognized source of CO, because it is driven by the same forcing mech- 
anism as enhanced CO, consumption by silicate weathering”, operates 
on a sufficiently long timescale (Fig. 2), and is consistent with proxy 
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records of weathering fluxes” * and atmospheric pco, variations during 
the Cenozoic (Fig. 3; refs 11, 12). 


METHODS SUMMARY 


All data for calculating the river fluxes shown in Fig. 1 have been compiled from 
published literature. The data were corrected for atmospheric inputs and partitioned 
between lithologic and acid sources using standard methods based on endmember 
elemental and isotopic ratios. 

The conceptual box model presented in Fig. 2 was constructed using a set of 
ordinary differential equations describing the concentrations of DIC derived from 
sedimentary rocks, DIC derived from atmospheric CO , and sulphate in the ocean 
considering only the fluxes related to the weathering or precipitation of carbonates 
and sulphides. 

The model” used to produce the results shown in Fig. 3 is based on mass balance 
equations that relate fluxes in the long-term Ccycle to isotopic records (81°C, 
®7Sr/°Sr, and '*°Os/'8”Os), dissolved Mg concentrations” and sea-floor spreading 
rates®. Changes in the fluxes relative to modern values are calculated by solving the 
equations simultaneously using smoothed fits of the proxy data. 

For the inverse model of the S cycle, we adopted the approach of ref. 31. Model 
inputs include the results of the model of ref. 10 (Fig. 3) for the time evolution of 
the riverine sulphate flux, the marine 5° * Ssaiphate record of ref. 27 and marine sul- 
phate concentration data from ref. 26. The equation was arranged to solve for a 
single S cycle parameter while holding the other parameters at constant values 
reflecting their predicted ranges. 

The ratio of O, consumption to CO, production implied by our model was 
calculated with equations (1)-(3) and used to assess the Cenozoic O2 budget for 
scenarios of enhanced silicate weathering over the past 15 Myr that bracket the 
range of predicted values”’®. Values of O2 consumption were converted to the po, 
required 15 Myr ago by modelling the atmosphere as an ideal gas (that is, equating 
the mole fraction with the volume fraction). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Calculation of CO, fluxes from modern rivers. All data for calculating the river 
fluxes shown in Fig. 1 have been compiled from published literature. CO uptake 
rates are calculated from fluxes of Na, K, Ca and Mg corrected for rainwater input 
using Cl concentrations and rainwater cation/Cl ratios. To partition the Ca flux 
between carbonate and silicate components, an endmember silicate Ca/Na ratio of 
0.35 was used for the Liwu”® and Mackenzie” and a value of 0.2 was used for 
the Ganges-Brahmaputra”” (G-B). To partition the Mg flux between silicate and 
carbonate sources, an endmember silicate Mg/Na ratio of 0.24 was used for the 
Liwu” and Mackenzie” whereas a Mg/K ratio of 0.5 was used for the G-B” 
following the methods of the original authors. All of the Na and K fluxes after 
correction for rain contributions are assumed to be derived from silicate sources 
except in the Mackenzie, where a portion of Na flux is contributed by evaporites 
and was accounted for using *’Sr/*°Sr ratios assuming an evaporite endmember 
ratio of 0.7081 (ref. 22). All of the silicate Ca and Mg fluxes were assumed to 
contribute to long-term CO, drawdown whereas only 30% of the Na flux and 20% 
of the K flux were considered*. For the G-B calculations, some of our values differ 
slightly from those reported in ref. 19, but this does not affect the overall inter- 
pretation. The calculated values are presented in Extended Data Table 1. 

To calculate CO, release rates, the proportion of the sulphate flux derived from 
sulphide oxidation was estimated using 35 *Scutphate (Liwu”), 3**Scutphate and 3SOcutphate 
(Mackenzie!”) or 8'°Cpic (G-B”). All of the sulphide-derived SO,’~ flux was 
assumed to react preferentially with carbonates, which provides a maximum 
estimate of CO, release rates. The relative difference between release and uptake 
rates (that is, the comparison shown in Fig. 1) does not change depending on the 
partitioning of the sulphide-derived SO,’~ between silicate and carbonate rocks. 
For the Liwu calculation, the 54S cuphate measurements used to calculate the pro- 
portion of sulphide-derived SO,”~ are from the adjacent Kaoping River” because 
there are no published values from the Liwu. Annual fluxes from the Liwu are 
considered because they are better constrained, and the two rivers drain similar 
lithologies*™". 

Conceptual box model. The conceptual box model (Fig. 2) was constructed using 
a set of ordinary differential equations describing the concentrations of DIC and 
sulphate in the ocean considering only the fluxes related to the weathering or 
precipitation of carbonates and sulphides (equations (1)-(5)). Initial values of 
8 X 10’ mol Myr 1, 5 X 10'8 mol Myr”? and 1 X 10'® mol Myr‘ are used for 
the input fluxes of total DIC, sedimentary DIC and sulphide-derived sulphate, 
respectively. The output fluxes (carbonate precipitation and sulphate reduction or 
sulphide precipitation) are assumed to vary linearly as functions of concentration 
with rate constants of 1 Myr ' for DIC and 0.1 Myr‘ for sulphate’. The model 
was run for 120 Myr to reach a steady state before the input fluxes were doubled 
instantaneously (Fig. 2a) or over the course of 40 Myr (Fig. 2b). 

Li and Elderfield mass balance model. The results from the adapted Li and 
Elderfield’® model shown in Fig. 3 are based on a series of isotope mass balance 
equations that relate the weathering of continental silicates, the weathering of 
sedimentary rocks (which includes organic C (Corg) and carbonate weathering), 
the weathering of basaltic rocks, reverse weathering or hydrothermal alteration, 
Corg burial, carbonate burial, and volcanic CO; degassing to the marine records of 
*7Sr/°°Sr (ref. 2), '87Os/'*8Os (ref. 3), 8'°Corg (ref. 33), 8'°Ccarbonate (ref. 33), 
dissolved Mg concentrations” and sea-floor spreading rates®. The mass balance 
equations are solved simultaneously using smoothed fits of the published marine 
proxy records to calculate changes in the fluxes of the different C cycle parameters 
relative to modern values (the modern values used in the calculation are presented 
in ref. 10). 

To simplify the model, various assumptions are made regarding the mass bal- 
ances of each element or isotope system. For the Os system, the isotopic composi- 
tion of the ocean is considered to be controlled by the weighted average of inputs 
from continental silicate rocks, island basalts, sedimentary rocks, hydrothermal 
activity and cosmic dust as described by the equation 


DAK;Os(R,°° = Rs ccan)] =0 (6) 


where K; is the intensity of the ith flux relative to the late Pleistocene value, Os; is 
the late Pleistocene value of the ith flux, R,°° is the normalized osmium isotopic 
composition of the ith flux and R,ccan is the normalized osmium isotopic com- 
position of the ocean’. This is a simplification of the Os system because different 
sedimentary lithologies contribute different amounts of Os with distinct '*8Os/'*7Os 
values. For example, studies of modern river systems**** have suggested that carbonates 
contribute non-radiogenic Os whereas organic-rich shales contribute more radio- 
genic Os. This is taken into account in our model by assigning a mean ‘**Os/"*’Os 
value for sedimentary rocks that reflects a weighted contribution from different 
sedimentary lithologies'®”*. Because this value is held constant over the Cenozoic, 
we assume that weathering fluxes from different sedimentary lithologies change in 


aconstant proportion. This is a reasonable assumption because different sediment- 
ary lithologies are interbedded ona large scale, and thus exhumed in tandem. If this 
proportion were to vary, one would predict that Os fluxes from carbonates would 
increase relative to organic-rich shales because carbonate minerals are more abund- 
ant’®. This would result in an underestimation of changes in sedimentary rock 
weathering making our calculation of CO, release from sulphide oxidation a min- 
imum estimate, thereby strengthening our argument about the potential import- 
ance of this mechanism. The Os system is further simplified by assuming a constant 
cosmic input flux. 

Owing to the long residence time of Sr relative to the modelling timescale, the 
transient-state solution to the mass balance equation is used, and expressed as 


NARS cean/dt = 7, [KSt(Ri* — R ocean) (7) 


where N is the amount of Sr in seawater, dR"'/dt is the time derivative of normal- 
ized Sr isotopic composition of the ocean, Sr; is the late Pleistocene value of the ith 
flux, R" is the normalized Sr isotopic composition of the ith flux and R* J can is the 
normalized Sr isotopic composition of the ocean'®**. In this mass balance, con- 
tributions from continental silicate rocks, island basalts, sedimentary rocks (that is, 
carbonates), hydrothermal activity and sediment diagenesis are considered. Like 
the Os isotope system, the reconstruction of weathering fluxes using the marine 
87Sr/°°Sr record is sensitive to the isotopic composition of source rocks. Although 
carbonate minerals typically have low °’Sr/*°Sr values relative to igneous rocks, 
intense metamorphic activity, such as that experienced in the Himalayan region, 
can impart unusually radiogenic Sr isotope compositions to carbonate minerals”. 
The potential for radiogenic Sr fluxes from metamorphic carbonates in the Himalayan 
region to bias our reconstruction of silicate weathering fluxes was previously 
assessed by Li et al.** and Li and Elderfield'°. This sensitivity analysis was con- 
ducted by modelling a linear increase in the isotopic composition of the Sr flux 
from Tibetan rivers (from 0.7077 to 0.7081) over the past 40 Myr and partitioning 
this flux in variable proportions between 100% carbonate and 100% silicate sources 
to test the effects of increasing the input of radiogenic Sr from Tibetan rivers over 
the Cenozoic on the reconstruction of weathering fluxes. Consideration of these 
endmember scenarios minimally affected the reconstructed silicate weathering 
fluxes'®**, 

To incorporate CO) release due to coupled sulphide oxidation and carbonate 

dissolution into this model, modern rates of sulphide oxidation were taken to be 
equal to 50% of the modern natural riverine S flux’® and kept in a constant pro- 
portion to the total weathering flux of sedimentary rocks, which is calculated on the 
basis of the mass balances of ®’Sr/*°Sr (equation (6)), 187Q5/'88Os (equation (7)), 
5'°C and alkalinity, but is most sensitive to the '*7Os/'8*Os mass balance. The net 
weathering CO, drawdown associated with uplift (Fig. 3) is calculated by taking the 
difference between the CO, consumption by silicate weathering and CO) release 
from coupled carbonate dissolution and sulphide oxidation. This approach assumes 
that a majority of sulphides are hosted in sedimentary rocks because it neglects 
contributions from igneous sulphide minerals. If igneous sulphide oxidation is a 
globally important process, our model provides an upper estimate of the net weath- 
ering CO; drawdown associated with uplift because it underestimates the CO> flux 
from sulphide oxidation. 
Sulphur isotope mass balance model. To determine how the pyrite burial flux, 
the 5°45 of riverine input, and the isotopic fractionation factor associated with 
sulphate reduction and pyrite burial (Apyrite-seawater) Would have to vary to recon- 
cile the marine 5*4S record with the reconstruction of sedimentary rock weath- 
ering from the Li and Elderfield’® model, the non-steady-state inverse-modelling 
approach of ref. 31 was used. In this approach, the differential equation describing 
the isotopic mass balance of S is 


d8™S/dt 7 (Friverine(S”“Sriverine ~ 3*"Socean) ~~ Poyrite burial pyrite-seawater)/M, (8) 


where d3**S/dt is the time rate of change of the isotopic composition of marine 
sulphate, F,iverine is the input flux of § from rivers (10'® mol Myr’ '), 5°4S,iverine is 
the isotopic composition of the riverine input, 5°*S, can is the isotopic composition 
of marine sulphate, F,yrite burial is the burial flux of $ as pyrite (10'8 mol Myr~'), 
Apyrite-seawater is the average fractionation factor associated with sulphate reduction 
and pyrite burial, and M, is the mass of marine sulphate (10'* mol). We rearrange 
equation (8) to calculate the time evolution of the parameter of interest. To con- 
strain the time variation of Fyiyerines We Use Our reconstruction of sedimentary rock 
weathering from the Li and Elderfield’® mass balance model (Fig. 3), assuming that 
the total S yield (that is, evaporite plus sulphide) from sedimentary rock weath- 
ering has remained constant and that the modern-day riverine sulphate flux is 
equal to 3 X 10'* mol $ Myr”! (ref. 16). By modelling the riverine input flux in this 
manner, we are assuming that both evaporite and sulphide oxidation fluxes increased 
in concert over the Cenozoic in response to tectonic uplift, which is geologically 
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reasonable because different sedimentary lithologies are interbedded on a large 
scale. As a result, the isotopic composition of the riverine input does not have to 
vary over the Cenozoic in our model. Nevertheless, the potential for variation in the 
isotopic composition of the riverine input to influence the Cenozoic S cycle was 
explored using the inverse-model calculations (see below). 

To constrain the time variation in the mass of marine sulphate, we use an expo- 
nential fit to the fluid inclusion data from ref. 26, following ref. 31. To constrain the 
time-dependent term d8*4S/dt, we calculate the first derivative of a smoothing 
spline fit of the 5°**Sparite record” after a 20-point robust loess smoothing of the raw 
data. This fit to the 5*“S,arite record is also used for the value of 3°4S,-ean in the 
model. 

The time variations of the fractionation factor associated with sulphate reduc- 

tion and pyrite burial (Apyrite-seawater)» the S isotopic composition of riverine input 
(5*4S,iverine) and the pyrite burial flux (Fpyriteburial) are all poorly constrained. As a 
result, it is necessary to perform separate model experiments where two of the 
parameters values are fixed at constant values and the third is calculated. In this 
study, three types of model experiments were conducted, (1) variable pyrite burial 
fluxes with constant 5°"S,;jerine and Apyrite-seawater (2) variable 5S .iverine With 
constant Foyrite burial and Apyrite-seawater and (3) variable A pyrite-seawater with con- 
stant Fpyriteburial 2Nd O° ”S,iverine. To calculate the implied changes in the pyrite 
burial flux (model type (1)), Apyrite-seawater WaS varied between -50 and -30%o0 
following Ref. 38 and the 5°4S,iverine input was varied between 0 and +10%o 
following Refs. 27,31. It is important to note that this range of 5°*S values for 
the riverine input does not necessarily reflect the balance between evaporite and 
sulphide sources, because each source displays large isotopic variability’* and riv- 
erine sulphate may be fractionated by biological processes before being discharged 
into the ocean”. To calculate the implied changes in 4 pyrite-seawater (model type (2)), 
the 5°*S of riverine input was varied between 0 and +10%o and the pyrite burial flux 
was kept at a constant value of 0.22 10!8, 0.67 X 10" or 1.0 X 10'8 mol S Myr, 
following refs 27, 31. To calculate the implied changes in the 8°“S of riverine input 
(model type (3)), Apyrite-seawater WaS varied between —50% and —30%0 and the 
pyrite burial flux was kept at a constant value of 0.22 x 10'8, 0.67 X 10’8 or 1.0 X 
10'° mol $ Myr '. The constant pyrite burial fluxes greater than 0.22 x 10'° mol S Myr” 
used in the model calculations of Apyrite-seawater and 3*S,iverine can be greater than 
the riverine input flux reconstructed from the Li and Elderfield’® isotope mass 
balance for a few time periods between 30 and 50 Myr ago. For this reason, the 
model experiments for variable 5*‘S of riverine input and Apyrite-seawater WETe 
evaluated only for the period 0-30 Myr ago. 
Solving the sulphur mass balance model for variable pyrite burial flux. For the 
model experiments where the mass balance equation was arranged to solve for the 
pyrite burial flux with constant values for the 5°45 of riverine input and Apyrite-seawaten 
the calculated pyrite burial fluxes were directly proportional to the riverine input 
flux. The constants of proportionality varied depending on the 5°'S of riverine 
input and Apyrite-seawater. Lower values of the 5°4S of riverine input and higher 
values of Apyrite-seawater Produced higher pyrite burial fluxes for the same riverine 
input flux (Extended Data Fig. 1). Multiple model experiments were consistent 
with the long-term average burial flux estimated in ref. 40 from the pyrite content 
of marine sediments (Extended Data Fig. 1). For the period around 45-50 Myr ago, 
where there is a major positive excursion in the marine 5°*S record”, some of the 
model experiments produced pyrite burial fluxes greater than the total riverine 
input flux (Extended Data Fig. 1). After this brief period, all of the model experi- 
ments produced burial fluxes that were lower than the total riverine input flux 
(Extended Data Fig. 1). For model experiments where the 5°'S of riverine input was 
equal to 10%p, calculated pyrite burial fluxes were less than the estimated sulphide 
oxidation fluxes used in the Li and Elderfield'° mass balance model (that is, 50% of 
the total S flux) regardless of the value of Apyrite-seawater (Extended Data Fig. 1). For 
model experiments where the 5°’ of riverine input was equal to 5%o, calculated 
pyrite burial fluxes were less than the estimated sulphide oxidation fluxes for 
Apyrite-seawater Values <—30%o (Extended Data Fig. 1). 

For model experiments where the 5°4S of riverine input was equal to 0%o, calcu- 
lated pyrite burial fluxes were less than the estimated sulphide oxidation fluxes for 
Apyrite-seawater Values <—40%o (Extended Data Fig. 1). For reference, the value of 
Apyrite-seawater eStimated from the average isotopic offset between coeval pyrite and 
marine sulphate for the past 200 Myr is —43 + 2%o (ref. 38). 

Solving the sulphur mass balance model for variable Apyrite-seawater- For the 
model experiments where the mass balance equation was arranged to solve for 
Apyrite-seawater With constant values for the pyrite burial flux and the 5*4S of riverine 
input, higher 5°*S values of riverine input and lower burial fluxes resulted in lower 
Apyrite-seawater Values for the same input flux (Extended Data Fig. 2). High burial 
fluxes or high values of 5°45 iverine or both, were required to maintain A pyrite-seawater 
values above —60%o, which has been suggested to be the maximum extent of 
isotopic fractionation during the Phanerozoic eon**“’. All model experiments pro- 
duced values that were distinct from estimates based on the measured average 
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isotopic offset between coeval pyrite and marine sulphate for at least a portion of 
the period from 0 to 30 Myr ago”’. From 0 to 12 Myr ago, each model experiment 
showed decreasing Apyrite-seawater Values (Extended Data Fig. 2). 

Solving the sulphur mass balance model for variable 5°45. iverine- For the model 
experiments where the mass balance equation was arranged to solve for the 8°S of 
riverine input with constant values for the pyrite burial flux and Apyrite-seawater 
larger values of Apyrite-seawater 2nd higher burial fluxes resulted in lower 35°45 values 
of riverine input for the same input flux (Extended Data Fig. 3). From 0 to 12 Myr 
ago, model experiments where the burial flux was =0.67 X 10'8 mol S$ Myr~' showed 
increasing values of the 5°*S of riverine input (Extended Data Fig. 3). These exper- 
iments also produced calculated 5**S values of riverine input at 0 Myr ago near 
estimates for the modern value”. 

Role of increased input fluxes of sulphur to the oceans. Increasing Cenozoic 
marine sulphate concentrations” require variation in the input or output flux of 
sulphate, or both. Because the variation of a single parameter within the S cycle 
would result in a change in the isotopic composition of marine sulphate, covaria- 
tion of different parameters during the Cenozoic is necessary to reconcile the 
relative constancy of the marine 5°‘S record”’ with increasing marine sulphate 
concentrations”®. The results presented here suggest that for a reasonable range of 
parameter values, increasing marine sulphate concentrations during the Cenozoic 
could result from increasing input fluxes from weathering if pyrite burial rates, 
Apyrite-seawater OF the 5°45 of riverine input, or all three, covary with the input flux 
(Extended Data Figs 1-3). Despite this agreement for some of the model experi- 
ments, a mechanism for the covariation with the riverine input flux is required. 

The factors that control the fractionation factor associated with sulphate reduc- 
tion and pyrite burial are incompletely understood, but Apyrite-seawater is thought to 
be positively correlated with sulphate concentrations” (that is, higher [SO,’~ ] 
leads to less isotopic fractionation). This behaviour is inconsistent with our model 
because a negative correlation between sulphate concentrations and Apyrite-seawater 
is necessary for the 5**S record to be reconciled with increasing input fluxes by 
changing only Apyrite-seawater (Extended Data Fig. 2). Additionally, the model cal- 
culations of Apyrite-seawater are not in good agreement with independent estimates 
from the sedimentary record**. As a result, it is unlikely that variation in 
Apyrite-seawater alone can explain variations in the Cenozoic S cycle. 

Model experiments where the burial flux of S was =0.67 X 10'* mol S Myr~ 
produced 8°"S,iverine Values in agreement with independent estimates’”*'. In these 
experiments, the relative constancy of the 5*4S of marine sulphate was partly main- 
tained in spite of increasing input fluxes through an increase in the 5°*S value of 
riverine input from 0 to 12 Myr ago (Extended Data Fig. 3). This increase could 
arise from a change in the source of sulphate in response to tectonic uplift. An 
increase in the portion of sulphate derived from evaporite weathering from <50% 
during a majority of the Cenozoic to the modern value of ~50% starting at ~12 Myr 
ago would sufficiently increase the 8*S value of riverine input. Such an increase in 
evaporite weathering would mean that we have underestimated the effects of 
sulphide oxidation in our isotope mass balance model (Fig. 3), because we assume 
that the portion of riverine sulphate derived from sulphide oxidation remained at 
the modern value for the entirety of the Cenozoic when it may have been greater 
during the early Cenozoic. 

It is also possible to increase the 5°*S value of riverine input without changing 
the proportion of sulphide-derived sulphate, because sulphide minerals themselves 
display a wide range of isotopic values depending on the depositional environment 
and the geologic age”*. Similarly, a greater flux of sulphate from environments where 
the microbial processing of riverine sulphate in groundwaters is important (for 
example the Himalayan system*) could produce an increase in the 5**S value of 
riverine input without changing the proportion derived from sulphide oxidation. 

Increasing pyrite burial fluxes could also reconcile the Cenozoic marine 5°*S 
curve with enhanced riverine fluxes and increasing sulphate concentrations (Extended 
Data Fig. 1). This linkage is consistent with the known coupling between organic C 
and pyrite burial fluxes** and the importance of physical erosion in driving enhanced 
rates of sulphide oxidation and organic C burial'”**. However, a tight coupling 
between riverine input and pyrite burial fluxes could potentially limit the timescale 
of transient CO; release in response to sulphide oxidation. The sensitivity analysis 
used in this study suggests that for a reasonable range of 5°4S values of riverine 
input and Apyrite-seawater (See above), sulphide oxidation fluxes can remain in excess 
of pyrite burial fluxes throughout the Cenozoic, which is required for transient 
CO) release (Extended Data Fig. 1). Although the magnitude of CO) release from 
sulphide oxidation will depend on the difference between pyrite oxidation and 
burial fluxes, the overall increase in CO} release during the Cenozoic is preserved. 
This maintains the general trend of changes in the net weathering CO, sink from 
tectonic uplift throughout the Cenozoic calculated with the Li and Elderfield’® 
mass balance model (Fig. 3). 

Role of decreased pyrite burial fluxes. Apart from an increase in the riverine 
input flux, a long-term decrease in pyrite burial fluxes could also explain increasing 
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Cenozoic marine sulphate concentrations. Unlike increasing riverine fluxes in 
response to tectonic uplift, it is unclear what processes could lead to a decrease 
in pyrite burial fluxes and not affect the isotopic composition of marine sulphate. 
However, it is important to note that decreasing pyrite burial fluxes over the 
Cenozoic would lead to an imbalance in the Scycle and CO) release into the 
ocean-atmosphere system in the same manner as enhanced continental sulphide 
oxidation. 

Summary of the S cycle modelling. The plausible variation in the 5°'S of riverine 
input and pyrite burial fluxes determined for a reasonable range of parameter 
values by our model calculations supports the idea that sulphide oxidation was 
enhanced during the Cenozoic in response to tectonic uplift. In addition to pro- 
viding a means of reconciling our hypothesis with the marine 5°’S record, these 
results also help to explain the observed increase in marine sulphate concentra- 
tions during the Cenozoic’. Regardless of the specific driver, the modelled scen- 
arios produce conditions under which COQ, is released into the ocean—atmosphere 
system in response to sulphide oxidation fluxes in excess of sulphide burial fluxes. 
Determining the effect of enhanced sulphide oxidation on Cenozoic py,. To 
assess the trade-offs between O2 consumption and CO) production implied by our 
model, we considered a molar ratio of O2 consumption to CO, production for pure 
sulphide oxidation of 15/8, which is based on equations (1)-(3) and is consistent 
with previous work quantifying the effect of sulphur redox transformations on 
atmospheric oxygen’. We have proposed that sulphide oxidation may have had 
an important role in balancing enhanced CO, consumption by silicate weathering 
over the last 15 Myr (see, for example, Fig. 3), and so we considered the specific 
case of the O.-CO> trade-off over this time interval (Fig. 4). We converted the O2 
consumption values to the required po, at 15 Myr by using a value of 37.8 X 10'* mol 
for the present-day mass of O, in the ocean-atmosphere system and treating the 
atmosphere as an ideal gas (that is, equating the mole fraction with the volume 
fraction). To compare these po,consumption values with the CO, fluxes, CO, 
drawdown from silicate weathering was calculated for linear increases of 10%, 
25% and 50% from 15 Myr ago to modern-day values. The range corresponding 
to increases of 10-50% brackets the range of previous predictions” and includes 
the values implied by Fig. 3. A modern-day silicate weathering flux of 3.5 X 10'* mol Myr 
was calculated from the flux data of refs 41, 45 and the global lithologic partitioning 
of ref. 41, assuming that 100% of the riverine silicate Ca”* and Mg** flux con- 
tributed to CO, drawdown and that 30% of the Na* flux and 20% of the K* flux 
contributed after exchanging for Ca" in marine sediments (following the approach 
of ref. 32, and as described above). The implied po, at 15 Myr ago, as considered in 
this analysis, can be readily compared to the fire limit of 25% O2 suggested by 
ref, 28, although the experiments of ref. 29 suggest that the fire limit is above 35% 
Oz, and the modelling results of ref. 23 suggest a Phanerozoic maximum value of 
~35% Oo. 

To determine the O, consumption implied by the S cycle inverse models (Extended 
Data Fig. 4), the net sulphur flux was calculated by subtracting the sulphide burial 
flux from the sulphide oxidation flux at million-year intervals. The net sulphide 
flux was integrated over the period from either 0 to 50 Myr ago or 0 to 15 Myr ago 
after performing a cubic interpolation using MATLAB. The integrated value was 
converted from moles S to moles O2 consumption using a ratio of O2 consumption 
to sulphide oxidation of 15/8 (equation (1)). Implied Palaeocene and Miocene po, 
values were determined, as above, on the basis of a present-day O, inventory in the 
atmosphere of 37.8 X 10° mol, and by treating the atmosphere as an ideal gas (that 
is, equating mole fraction with the volume fraction). 

Implications of enhanced sulphide oxidation for Cenozoic po, . Integrated over 
the past 50 Myr, the O, consumption implied by the results of the models with 
variable sulphide burial varies from —26.8 X 10'* to 27.5 X 10'8 mol Op, where nega- 
tive values reflect net O2 production (Extended Data Fig. 4a). Net oxygen produc- 
tion is a consequence of calculated pyrite burial fluxes that are higher than the 


sulphide oxidation fluxes used as model input (see above). Integrated over the past 
15 Myr, O, consumption varies from —9.9 X 10"* to 14.5 X 10"* mol O, (Extended 
Data Fig. 4b). 

For all of the model experiments where pyrite burial fluxes were fixed at con- 
stant values, O2 consumption integrated over the entire period from 0 to 50 Myr 
ago is 34.1 X 101°, —7.9 x 1018 or —38.8 X 10'® mol O; for respective burial fluxes 
of 0.22 X 10'8, 0.67 x 10'® and 1018 molS Myr! (Extended Data Fig. 4a). Inte- 
grated over the period from 0 to 15 Myr ago, O2 consumption is 19.3 X 10'°, 6.7 X 10"8 
or —2.6 X 10'8 mol Oj for respective burial fluxes of 0.22 x 10!8, 0.67 X 10'8 and 
108 mol S Myr! (Extended Data Fig. 4b). 

The results of the S cycle model imply changes in atmospheric O2. Although 
they notably do not include consideration of the organic C cycle, which may also 
influence po, the results of the S cycle model imply Palaeocene po, values of 23-33% 
for the combinations of parameter values that produce net O2 consumption when 
integrated over the entire period from 0 to 50 Myr ago (Extended Data Fig. 4a). 
Arguably, the combinations of parameter values that produce net O, consump- 
tion together with CO, production (that is, a 5°4S of riverine input >0%o and 
Apyrite-seawater = —40) are the most geologically reasonable given that the 5°45 of 
riverine input is thought to be between +3 and +8%o (refs 40, 46), and the average 
Apyrite-seawater iS thought to be —43 + 2%o (ref. 38). 

Considering only the past 15 Myr, which is when the Li and Elderfield'® model 
suggests that sulphide oxidation acted to balance enhanced CO, consumption as a 
result of silicate weathering, the parameter combinations that produce net sulph- 
ide oxidation imply Miocene po, values of less than 30% and would produce suf- 
ficient CO to balance plausible increases in silicate weathering fluxes (Extended 
Data Fig. 4b). This is consistent with the analysis described in Fig. 4. 
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Extended Data Figure 1 | Riverine input fluxes and calculated pyrite burial 
fluxes from 0 to 50 Myr ago for constant 5°4S values of riverine input and 
Apyrite-seawater SOlid and dashed black lines indicate the total and pyrite- 
derived input fluxes of sulphate from the Li and Elderfield’® model, respectively. 
The green band indicates the range of calculated pyrite burial fluxes for 
experiments where the 5°'S value of riverine S flux was equal to 10%o and 


Apyrite-seawater Was varied between —30 and —50%po. The pink band indicates the 


range of calculated pyrite burial fluxes for experiments where the 5°'S value 
of riverine S flux was equal to 5%o and Apyrite-seawater Was Varied between 

—30 and —50%po. The blue band indicates the range of calculated pyrite burial 
fluxes for experiments where the 5°4S value of riverine was equal to 0%o and 
Apyrite-seawater WaS varied between —30 and —50%o. The khaki bar indicates the 
range of pyrite burial fluxes estimated from the pyrite content of marine 
sediments by ref. 40. 
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Extended Data Figure 2 | Calculated Apyrite-seawater Values from 0 to 30 Myr 
ago for constant 5°“S values of riverine input and pyrite burial fluxes. The 
solid and dashed black lines indicate the average and minimum Apyrite-seawater 
values for the Phanerozoic based on the isotopic offset between coeval 
marine sulphate and sulphide minerals**“°. The thickness of the solid line 
reflects 1 s.d. of the average Apyrite-seawater from ref. 38. The green band indicates 
the range of calculated Apyrite-seawater Values for experiments where the pyrite 
burial flux was 1 X 10'* mol $ Myr ' and the 5°4S value of riverine was 


varied between 0 and 10%o. The blue band indicates the range of calculated 
Apyrite-seawater Values for experiments where the pyrite burial flux was 

0.67 X 10'* mol S Myr | and the 8*4S value of riverine was varied between 
0 and 10%. The brown band indicates the range of calculated 
Apyrite-seawater Values for experiments where the pyrite burial flux was 

0.22 x 10'8 mol S Myr’! and the 5°4S value of riverine was varied between 
0 and 10%po. 
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Extended Data Figure 3 | Calculated 5*4S values of riverine input from 0 to | —50%o. The blue band indicates the range of calculated 5°'S values of riverine 
30 Myr ago for constant Apyrite-seawater Values and pyrite burial fluxes. The _ input for experiments where the pyrite burial flux was 0.67 10’ mol S Myr! 


solid black line on the vertical axis indicates the range of estimates for the 
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Extended Data Figure 4 | Integrated O, consumption and CO, production 
implied by the S cycle inverse model for different input parameters. The O, 
consumption values are presented in molar units. Importantly, the implied 
Po, values ignore other processes affecting the O2 budget (for example changes 
in the organic C cycle). Negative values of both CO, and O2 consumption 
reflect net sulphide burial. Independent estimates of the 5*“S of riverine 
input*’’* and the average Apyrite-seawater (ref. 38) correspond to the parameter 
combinations that produce net sulphide oxidation (that is, positive values) and 
are marked with stars. a, The model results integrated from 0 to 50 Myr ago. 


b, The model results integrated from 0 to 15 Myr ago following the results of the 
Li and Elderfield’® model, which suggests that sulphide oxidation acted at least 
in part to balance enhanced CO, consumption over this time period. The 
dashed horizontal lines represent the CO, release required to compensate fully 
for a 10% (blue), 25% (purple) or 50% (orange) increase in continental silicate 
weathering that occurred linearly over the past 15 Myr assuming a modern flux 
of 3.5 X 10!8 mol Myr * (Methods). The 25% increase corresponds to the 
increase predicted by the Li and Elderfield’? mass balance model. 
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Extended Data Table 1 | CO2 consumption and release rates in modern river systems?719-22 
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Melanosome evolution indicates a key physiological 
shift within feathered dinosaurs 


Quanguo Li’, Julia A. Clarke”, Ke-Qin Gao*, Chang-Fu Zhou’, Qingjin Meng”, Daliang Li®, Liliana D’ Alba’ & Matthew D. Shawkey’ 


Inference of colour patterning in extinct dinosaurs’ * has been based 
on the relationship between the morphology of melanin-containing 
organelles (melanosomes) and colour in extant bird feathers. When 
this relationship evolved relative to the origin of feathers and other 
novel integumentary structures, such as hair and filamentous body 
covering in extinct archosaurs, has not been evaluated. Here we 
sample melanosomes from the integument of 181 extant amniote 
taxa and 13 lizard, turtle, dinosaur and pterosaur fossils from the 
Upper-Jurassic and Lower-Cretaceous of China. We find that in the 
lineage leading to birds, the observed increase in the diversity of 
melanosome morphologies appears abruptly, near the origin of pin- 
nate feathers in maniraptoran dinosaurs. Similarly, mammals show 
an increased diversity of melanosome form compared to all ecto- 
thermic amniotes. In these two clades, mammals and maniraptoran 
dinosaurs including birds, melanosome form and colour are linked 
and colour reconstruction may be possible. By contrast, melanosomes 
in lizard, turtle and crocodilian skin, as well as the archosaurian 
filamentous body coverings (dinosaur ‘protofeathers’ and pterosaur 
‘pycnofibres’), show a limited diversity of form that is uncorrelated 
with colour in extant taxa. These patterns may be explained by con- 
vergent changes in the key melanocortin system of mammals and birds, 
which is known to affect pleiotropically both melanin-based colou- 
ration and energetic processes such as metabolic rate in vertebrates’, 
and may therefore support a significant physiological shift in man- 
iraptoran dinosaurs. 

Melanin-based colour is a ubiquitous feature of amniote integument, 
found in feathers of every major bird clade*’, as well as in skin and hair 
(the scales covering skin in reptiles are typically transparent’). We 
examined the relationship between integumentary structure and mela- 
nosome morphology in a phylogenetic context. Individual melanosomes 
were measured from scanning electron microscope (SEM) images of 
the hairs of 44 species of extant mammals (n = 51 samples) and the skin 
of 36 extant species (n = 36 samples), sampling across Lepidosauria, 
Testudines and Crocodylia (Supplementary Tables 1-3). These mea- 
surements were compared with our previously published data from 
101 extant avian species (n = 168 feather samples’). Samples targeting 
the full range of melanin-based colours for each integument type included 
blacks, browns and greys for feathers, hair and skin. As feathers uniquely 
show a broad range of melanin-based iridescent colours, and these are 
known to be associated with distinctive melanosome morphologies’, 
they were also included. Although iridescent colours are found in rep- 
tile skin* and (rarely) in hair’, they are produced through light scatter- 
ing from iridiphores and multilayer keratin films, respectively. As these 
colours are not melanin-based, the integuments in which they are pro- 
duced were not sampled. To capture melanosome diversity fully also 
demanded inclusion of morphotypes associated with black colour that 
are so far known only from extant penguins”. 

Preserved integument from 13 fossil amniotes from the Upper Jurassic 
and Lower Cretaceous of northeast China were sampled using described 


protocols**° (Supplementary Methods). Skin was sampled from two 
fossil lepidosaurs, a turtle, and two specimens of the ornithischian 
dinosaur Psittacosaurus (Extended Data Figs 1-3 and Supplementary 
Tables 2 and 3). Although extant amniote epidermal appendages are 
limited to scales, hairs and feathers!’, additional structures are observed 
in the fossil record. These include basally bunched or single hollow fila- 
ments in theropod dinosaurs proposed to be homologous with modern 
feathers!*""*, bristles and filaments in ornithischian dinosaurs!*°, and 
pterosaur pycnofibres, the identity and homology of which are more 
controversial’’. Filamentous structures were sampled from the theropod 
dinosaur Beipiaosaurus and two pterosaurs (Extended Data Figs 4, 5 and 
Supplementary Tables 2, 3). Feathers were sampled from Caudipteryx, 
Confuciusornis, one ornithurine and two enantiornithine birds (Extended 
Data Figs 2, 6-8 and Supplementary Tables 2 and 3). Published SEM 
images and data were assessed for filamentous structures in the non- 
maniraptoran coelurosaurian dinosaur Sinosauropteryx’, feathers from 
the basal paravians Anchiornis’, Microraptor and Archaeopteryx'* and 
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Figure 1 | Melanosome length observed in extant feathers, lepidosaur, 
testudine and archosaur skin, and mammalian hair. Boxplot colours 
correspond with integument colour: black, brown and grey. For feathers, 
‘penguin-like’ is shown in blue, and iridescent is shown in purple. Lines are 
median values, boxes are quartiles, lines are range. Boxplots sharing the same 
letter (v, w, x, y, z) are not significantly different (two-sided Tukey HSD; 
P<0.05) from one another; melanosome shape correlates with distinct colours 
in feathers and hair but not in skin. Extant feathers, n = 168; lepidosaur, 
testudine and archosaur skin, n = 36; mammalian hair, n = 51. 
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the extinct penguin Inkayacu’®. We used factorial analyses of variance 
(ANOVAs) and Tukey HSD (honestly significant difference) post-hoc 
tests in the R programming language” to test for differences in mela- 
nosome morphology by colour and integument type in extant samples 
(Fig. 1, Supplementary Methods and Extended Data Fig. 9). Canonical 
discriminant analysis (CDA) assessed the accuracy of the colours pre- 
dicted for extant skin and hair samples based on melansome morphol- 
ogy (Supplementary Methods). 

Across extant Amniota, the range of melanosome shape and form 
differs between taxa and integumentary types (Figs 1-4). These differ- 
ences are directly related to the ability to predict colour from melano- 
some morphology (Supplementary Methods). For example, unlike in 
extant bird feathers'°, extant lepidosaur, turtle and crocodilian skin 
colour cannot be predicted from morphology alone (Fig. 1; Supplemen- 
tary Methods). Diversity of melanosome morphology in these taxa is 
limited (Figs 1 and 2a, b) compared to mammals and birds (Figs 2c-f 
and 4). Although CDA analyses of extant feathers classify colours with 
82% accuracy based on morphology’, no two colours of skin could be 
discriminated. Indeed, because no morphological variable was correlated 
with colour in extant skin, none could be used in CDA (Supplementary 
methods). Overall, melanosome length was more variable than diameter 
within and among amniote integument types. For hair (Fig. 2c, d), a 
CDA with melanosome length as a variable weakly predicted colour 
when grey hairs were included (59% accuracy), but predicted colour 
well (87% accuracy) when they were excluded (Supplementary Methods). 
This is likely because, in contrast to grey feathers, grey hair is typically 


Lepidosaur, testudine 
and archosaur skin 


Mammalian hair 


Extant feathers 


Figure 2 | SEM images of melanosomes from extant taxa representing the 
lowest-aspect-ratio and highest-aspect-ratio forms for each integumentary 
type. a, b, Skin of leaf-tailed gecko, Uroplatus fantasticus (a), and Galapagos 
fire lizard, Microlophus albemarlensis (b). c, d, Hair from hairy-tailed mole, 
Parascalops breweri (c), and domestic cat, Felis catus (d). e, f, Feathers from 
tufted titmouse, Baeolophus bicolor (e), and Huon astrapia, Astrapia rothschildi 
(f). Scale bars, 1 um. 
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produced by macroscale patterns of alternating dark and light hairs or 
striping within single hairs, rather than associated with distinct mela- 
nosome shapes’ (Supplementary Methods). 

Only two shifts in integumentary structure in amniotes, the origin of 
pinnate feathers and mammalian hairs, are associated with shifts in 
melanosome diversity (Fig. 4). CDA indicates that colour and melano- 
some morphology are linked in the extant parts of these clades (Sup- 
plementary Methods). By contrast, no similar shift is observed in the 
transition to filamentous integument in archosaurs or in the transition 
from single hollow filaments to branched and basally joined bundles of 
filaments in Coelurosauria’’ (Fig. 4). If the structures in pterosaurs, 
ornithischian dinosaurs and theropods represent three separate origins of 
filamentous structures instead of one early in Archosauria, this pattern 


Figure 3 | Melanosomes from Jehol Group fossils. ah, Images show the 
differences between low-aspect melanosomes consistently observed in 
lepidosaur skin (PKUP V1059; a, b) and filamentous structures in pterosaurs 
(BMNHC PH000988; ¢, d); and between non-maniraptoran dinosaurs 
(Beipiaosaurus; BMNHC PH000911; e, f) and the high-aspect-ratio forms 
observed in maniraptoran feathers (undescribed ornithurine bird, CUGB 
G20100053; g, h). Dots in a, c, e and g indicate the location of the samples 
shown in b, d, f and h. Dot colours correspond to the colours in Fig. 4: light 
green, extinct lepidosaurian, testudine and crocodilian skin; dark brown, 
filamenous body covering in dinosaurs; light brown, pterosaurs; orange, 
feathers in extinct stem avialan taxa. See Extended Data Figures for complete 
sampling maps. Scale bars, 1 um. 
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Figure 4 | Melanosome diversity across Amniota. Scatterplots of individual 
melanosome measurements (diameters and lengths) by integument type for 
extant mammal hair (blue, n = 719), skin from extant (dark green, n = 742) 
and extinct (light green, n = 925) lepidosaurian, testudine and archosaurian 
species, filamentous body covering in pterosaurs (light brown, n = 899), 
filamentous body covering in dinosaurs (dark brown, n = 164), feathers in 
basal Paraves (yellow, n = 1,268), Confuciusornis and crown-ward extinct 
avialan taxa (orange, n = 1,683), extant Aves (bright red, n = 3,294) and 


would be even stronger. Observed melanosome size and variation were 
similar in extinct and extant taxa of each major amniote group and for 
each integumentary type (for example, extant and fossil skin) (Figs 2-4). 
Accounting for proposed taphonomic alteration of fossil melanosome 
size (that is, average 18-20% reduction in size”’) does not affect recov- 
ered patterns (Extended Data Fig. 10; compare with Fig. 4). Indeed, fit 
of sampled fossil lepidosaur, testudine and archosaur melanosomes 
with extant taxa argues against significant taphonomic alteration (see 
also Supplementary Methods). 

Pterosaurs and non-maniraptoran dinosaurs show a limited range of 
low-aspect-ratio (length:width, <2) melanosome morphologies (Figs 3c-f 
and 4) similar to the morphospace occupied by fossil and extant lepi- 
dosaurs, crocodylians and turtles (Figs 3a, b and 4). Low-aspect-ratio 
melanosomes in Sinosauropteryx filaments have been interpreted as 
indicative of a reddish brown colour based on avian comparisons’. By 
this logic the newly sampled pterosaurs and Beipiaosaurus (Fig. 3c, f 
and Extended Data Figs 4 and 5) would all be inferred as similarly brown 
(avian CDA; Supplementary Methods). However, plesiomorphically, low- 
aspect-ratio melanosomes from skin (Fig. 2a, b) produce black, brown or 
grey colours, and there is no evidence that derived avian relationships 
between shape and colour apply to more basal taxa occupying a similar 
(outgroup) melanosome morphospace (Fig. 1, Extended Data Fig. 9 and 
Supplementary Table 2). Colour reconstruction for these archosaurian 
taxa based on melanosome morphology is therefore cautioned against. 

Melanosome diversity in Amniota does not track novelties in the mate- 
rial of which these integumentary structures are comprised, previously 
proposed differences in melanin chemistry, or the range of colours 
produced (Fig. 4). In terms of novelties in material, novel B-keratins 
arose in both Lepidosauria and Testudines, some are shared by birds 
and crocodilians, and some are unique to birds*’. However, only those 
unique to birds may be associated with observed changes in the mela- 
nin system. Mammalian hair consists of structurally distinct «-keratin 
but exhibits melanosome diversity approaching that in birds (Figs 2 
and 4). In terms of previously proposed differences in melanin chem- 
istry, although phaeomelanin was long thought to be exclusive to mam- 
mals and birds”, it has recently been reported in amphibians” and 
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flightless palaeognath birds (dark red, n = 107). Colours of silhouettes 
correspond with colours in scatterplots. Black indicated unsampled taxa or 
integumentary type (for example, bristle structures on the tail of 
Psittacosaurus). Grey regions in scatterplots indicate the extent of melanosome 
diversity in all samples (total observed melanosome morphospace), and 
regression lines are drawn for comparison among integumentary types. Shaded 
areas around lines indicate 95% confidence intervals. 


turtles’*, and thus its distribution does not explain the recovered dif- 
ferences in melanosome diversity. Finally, in terms of the range of 
colours produced, colouration is a key function of melanin in amni- 
otes and the range of colours across our sampled taxa vary markedly. 
However, melanosome diversity in amniotes is not predicted by how 
colourful the extant parts of these taxa are today. Lepidosaurs show 
low melanosome diversity and brightly coloured species make use of 
other colour-producing mechanisms*. Mammals generally exhibit more 
muted colours than those observed in birds and lack structural coloura- 
tion other than a weak iridescence produced by a keratin multilayer in 
golden moles’. However, they show a striking diversity of melanosome 
morphologies, most similar to the range observed in birds (Fig. 4). 

Shifts in the melanin-based colour system in amniotes may instead 
be explained by physiological innovations. Convergent changes in the 
melanin-based colour system towards an expanded range of melanosome 
lengths, diameters and aspect ratios occur in maniraptoran dinosaurs 
and in mammals (Figs 2-4). Interactions between the agouti signalling 
protein (ASIP) and the melanocortin 1 receptor (MC1R) have been 
implicated in shifts in melanin synthesis in developing feathers’ as 
well as hair”®. Within-feather patterning, such as stripes, results from 
these interactions*”® and first appears within maniraptoran dinosaurs’ 
coincident with the increase in melanosome diversity noted here (Fig. 4). 
The changes we observe are thus consistent with independent shifts in 
at least these axes of the melanocortin system in the lineages leading to 
mamumnals and birds. 

The melanocortin system pleiotropically affects both melanin-based 
colouration as well as key energetic processes such as metabolic rate, 
reproductive physiology, stress axis and food intake across vertebrates 
(from bony fish to humans*””’). In Aves, inter- and intraspecific vari- 
ation in melanic colour has been linked to differences in energetics and 
metabolism*”, and a possible role of convertase enzymes PC1/3 and 
PC2 was recently implicated in pleiotropic effects*®. ASIP also has well- 
characterized pleiotropic effects”, and in particular the ASIP-MC1R 
interaction was recently recognized to have a physiological as well as 
pigmentary role in birds”. Mirroring the broad patterns between ecto- 
and endothermic animals, our results show that avian taxa that exhibit 
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lower basal metabolic rates, flightless paleognaths”, also exhibit lower 
melanosome diversity (Fig. 4). The fact that melanosome diversity in 
endothermic extant amniotes is similar, despite profound differences 
in keratin chemistry”', integumentary structure and melanocortin 
pathways*”’, may be because melanosome diversity is pleiotropically 
linked to changes in energetics associated with the higher metabolic 
rates they uniquely share. Although further investigation is clearly 
required, the change in melanin-based colour reported here, and the 
origin of within-feather melanin-based colouration in Maniraptora’, 
may indicate when a key shift in dinosaurian physiology occurred before 
the origin of flight. 


METHODS SUMMARY 


We obtained data on melanosome morphology from extant hair and skin samples 
using techniques previously described for feathers’. In brief, hair and skin samples 
were embedded in plastic and sectioned longitudinally with a glass knife on an ultra- 
microtome. These sections were viewed on a JEOL JSM7401F SEM. The image- 
processing program Image] (available for download at http://rsbweb.nih.gov/ij/) 
was used to measure length, diameter, aspect ratio and density in SEM images 
acquired. We used ANOVA and Tukey post-hoc tests to compare these variables 
between integumentary type, and a canonical discriminant analysis to determine 
whether colour of skin or hair could be predicted from these morphological vari- 
ables. We took small (approximately 1 mm”) samples from a range of integumentary 
structures in the fossil record, including feathers, skin, ‘pycnofibres’ and ‘proto- 
feathers’ and viewed them on a ZEISS SUPRA-55 VP SEM. Melanosome mor- 
phology data were collected from these samples using Image]. We visualized all 
data using the ggplot2 package in R”’. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Samples of extant amniote integument were used to capture the diversity of mela- 
nosomes associated with melanin-based colours found in each clade (see Sup- 
plementary Methods for details). Extant hairs were pulled from study specimens of 
mammals in the collections of the University of Akron and the San Diego Museum 
of Natural History, and small (approximately 5 mm/) sections of skin were cut 
from melanized portions near the forelimb of formalin-preserved reptiles in the 
collection of the Smithsonian Museum of Natural History. Sampled species are 
listed in Supplementary Table 2. We obtained data on melanosome morphology 
from these samples as described previously’. First, skin or hair was embedded in 
Epon by dehydration using 100% ethanol (20 min) twice, and by infiltration with 
15,50, 70 and 100% Epon (24h each step). Infiltrated samples were then placed in 
block moulds and polymerized at 60 °C for 16h. We then cut thick (5 j1m) longi- 
tudinal sections of blocks with a glass knife on a Leica UC-6 ultramicrotome, mounted 
them on stubs with carbon tape, sputter-coated them with silver and viewed them 
on a JEOL JSM7401F SEM. 

From the resulting SEM images we used the image-processing program Image] 
(available for download at http://rsbweb.nih.gov/ij/) to measure melanosome mor- 
phology as described previously’. We measured maximum linear short and long 
axis length of melanosomes that were oriented perpendicular to line of sight, and 
from these data calculated aspect ratio (long:short axis). These ratios are an index 
of shape, and values close to 1 indicate sphericity, whereas values further from 1 
indicate cylindricity. The distribution of melanosome morphology within feathers 
was frequently skewed towards one type of morphology, therefore we also calcu- 
lated the skewness of the long and short axes. We calculated density as the number 


of melanosomes per square micron of integumentary surface area. Using the R 
environment, we compared these values between extant coloured integument using 
ANOVA (aov function) and Tukey’s post-hoc test (Tukey HSD function). Results 
are summarized in Fig. 1 and Supplementary Table 1. 

We used a quadratic canonical discriminant analysis to determine whether col- 
our could be predicted from melanososome morphology in hair and skin. We used 
a quadratic rather than linear discriminant analysis because some variables showed 
evidence of collinearity, but use of linear discriminant analysis did not signi- 
ficantly alter results. We used a forward stepwise method to choose variables 
that contributed significantly (P< 0.05) to the analysis and to eliminate those 
that did not. 

We sampled fossils with preserved integument spanning the range of known 
forms, including skin, ‘protofeathers’, ‘pycnofibres’ and pinnate feathers. We col- 
lected morphological data from SEM images of fossil integument from the Jehol 
biota in the same manner as the extant samples. Small (approximately 1 mm”) 
samples were taken from preserved integument of fossils as before’. Each specimen 
was sampled as thoroughly as possible to maximize the potential diversity of mela- 
nosomes while maintaining the integrity of the fossil. The samples were coated 
with silver (30 s) and studied with either a ZEISS SUPRA-55 VP field emission scan- 
ning electron microscope (situated at China University of Geosciences, Beijing) or 
the JEOL environmental scanning electron microscope used for extant samples. 
Morphological measurements from melanosomes preserved as imprints and in 
three dimensions were assembled in the same manner as the modern samples. 
There was no taxonomic variation in the frequency of distinct preservational 
styles. These data are summarized in Fig. 3 and Supplementary Table 3. 
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Extended Data Figure 1 | Sampling map of an unnamed turtle fossil, PKUP __ sites are presented in Fig. 4 and pooled for the per-taxon values given in 
V1070 (top) and a lizard fossil, Yabeinosaurus sp. PKUP V1059 (bottom). Supplementary Table 2. 
Numbers indicate integument sampling sites; melanosome data from all 
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Extended Data Figure 2 | Sampling map of a gliding lizard fossil, sites are numbered. Numbers indicate integument sampling sites; melanosome 
Xianglong zhaoi, PMOL 000666, counterpart (top) and the basal avialan, data from all sites are presented in Fig. 4 and pooled for the per-taxon values 
Confuciusornis sanctus, CUGB G20070001 (bottom). Integument sampling —_ given in Supplementary Table 2. 
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Extended Data Figure 3 | Skin sample from the ornithischian dinosaur, indicate integument sampling sites; melanosome data from all sites are 
Psittacosaurus lujiatunensis, PKUP V1050 (top) and from the ornithischian _ presented in Fig. 4 and pooled for the per-taxon values given in Supplementary 
dinosaur, Psittacosaurus lujiatunensis, PKUP V1051 (bottom). Numbers Table 2. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


gyn ng 
cm 1) 2 3) 4 5) 
i PA FA RA ma 
iuadtveatnaa Tune Taqativad via tavad tintin 


Extended Data Figure 4 | Sampling map for two pterosaurs, PMOL integument sampling sites; melanosome data from all sites are presented in 
AP00022 (top) and BMNHC PH000988 (bottom). Numbers indicate Fig. 4 and pooled for the per-taxon values given in Supplementary Table 2. 
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Extended Data Figure 5 | Samples from filaments preserved in the neck melanosome data from all sites are presented in Fig. 4 and pooled for the 
region of a skeleton of the theropod dinosaur, Beipiaosaurus, BMNHC per-taxon values given in Supplementary Table 2. 
PH000911 (counter slab). Numbers indicate integument sampling sites; 
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Extended Data Figure 6 | Sampling map of the feathered maniraptoran sampling sites; melanosome data from all sites are presented in Fig. 4 and 
dinosaur, Caudipteryx zoui, PMOL AD00020. Numbers indicate integument pooled for the per-taxon values given in Supplementary Table 2. 
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Extended Data Figure 7 | Sampling map of (top) an unnamed sites; melanosome data from all sites are presented in Fig. 4 and pooled for the 
enantiornithine bird, CUGB G20120001 and (bottom) an undescribed per-taxon values given in Supplementary Table 2. 
enantiornithine bird, CUGB P1201. Numbers indicate integument sampling 
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Extended Data Figure 8 | Sampling map of an undescribed ornithurine melanosome data from all sites are presented in Fig. 4 and pooled for the 
bird, CUGB G20100053. Numbers indicate integument sampling sites; per-taxon values given in Supplementary Table 2. 
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Extended Data Figure 9 | Melanosome diameters and aspect ratios observed 
in extant feathers, lepidosaur, testudine and archosaur skin, and 

mammnalian hair. Melanosome diameters are shown in a, and aspect ratios are 
shown in b. Boxplot colours correspond with integument colour: black, brown, 


vw; V 
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hairs skin 


grey. For feathers, ‘penguin-like’ is shown in blue and iridescent is shown in 
purple. Lines are median values, boxes are quartiles, lines are range. Boxplots 
sharing the same letter (v, w, x, y, z) are not significantly different from one 
another. 
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Extended Data Figure 10 | Exploration of the potential effects of 
taphonomy and sampling on the observed differences in melanosomes in 
skin, hair, filaments and feathers. Top, melanosome diversity is adjusted to 
model taphonomic shrinkage of melanosomes suggested from experimental 
studies. Values for all fossil samples were adjusted (enlarged by 20%) based on 
the findings of ref. 18 (Supplementary Methods). Original data points are 
shown in colours and adjusted data are shown in grey. Grey regions indicate the 
extent of the total melanosome morphospace from the primary analyses. 

The pattern reported (ie., increased diversity and higher-aspect-ratio forms 
only in Maniraptora and Mammalia) is not affected (Fig. 4, main text; n for each 
integumentary type is identical to the primary analysis). Bottom, to consider 
the effect of sampling on the observed pattern samples from near the thoracic 
region were removed from the database. Although there was no evidence to 
suggest that these samples were from internal organs, or that such organs 
were preserved, because melanosomes are present in some internal organs in 
extant taxa, the sensitivity of the results to removal these samples was explored. 
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There was no effect on the pattern reported from the primary analysis (compare 
Fig. 4). Samples 1 and 2 from Yabeinosaurus sp. (PKUP V1059), sample 1 from 
Psittacosaurus lujiatunensis (PKUP V1050), samples 121, 122 and 123 from 
Caudipteryx zoui (PMOL AD00020), samples 11, 13 and 14 from an 
undescribed enantiornithine (CUGB P1201), samples 30 and 33 from an 
undescribed enantiornithine (CUGB G20120001), and samples 64-69, 89 
and 90 from an undescribed ornithurine bird (CUGB G20100053) were 
removed from the database. Samples are colour coded as in Fig. 4, main text 
and adjusted n for subsampling analysis follows: extant mammal hair (blue, 
n= 719), skin from extant (dark green, n = 742) and extinct (light green, 

n = 605) lepidosaurian, testudine and archosaurian species, feathers in basal 
Paraves (yellow, n = 1,212), Confuciusornis and crown-ward extinct avialan 
taxa (orange, n = 1,376), extant Aves (bright red, n = 3,294) and flightless 
palaeognath birds (dark red, n = 107). Colours of silhouettes correspond with 
colours in scatterplots. Black indicated unsampled taxa or integumentary type 
(e.g., bristle structures on the tail of Psittacosaurus). 
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Genomic studies have shown that Neanderthals interbred with modern 
humans, and that non-A fricans today are the products of this mixture’. 
The antiquity of Neanderthal gene flow into modern humans means 
that genomic regions that derive from Neanderthals in any one human 
today are usually less than a hundred kilobases in size. However, Nean- 
derthal haplotypes are also distinctive enough that several studies 
have been able to detect Neanderthal ancestry at specific loci’? *. 
We systematically infer Neanderthal haplotypes in the genomes of 
1,004 present-day humans’. Regions that harbour a high frequency 
of Neanderthal alleles are enriched for genes affecting keratin fila- 
ments, suggesting that Neanderthal alleles may have helped modern 
humans to adapt to non-African environments. We identify multiple 
Neanderthal-derived alleles that confer risk for disease, suggesting 
that Neanderthal alleles continue to shape human biology. An unex- 
pected finding is that regions with reduced Neanderthal ancestry are 
enriched in genes, implying selection to remove genetic material 
derived from Neanderthals. Genes that are more highly expressed 
in testes than in any other tissue are especially reduced in Neander- 
thal ancestry, and there is an approximately fivefold reduction of 
Neanderthal ancestry on the X chromosome, which is known from 
studies of diverse species to be especially dense in male hybrid ster- 
ility genes’”"”. These results suggest that part of the explanation for 
genomic regions of reduced Neanderthal ancestry is Neanderthal 
alleles that caused decreased fertility in males when moved to a modern 
human genetic background. 

To search systematically for Neanderthal haplotypes, we developed 
a method based on a conditional random field’* (CRF) that combines 
information from three features of genetic variation that are inform- 
ative of Neanderthal ancestry (Supplementary Information section 1 
and Extended Data Fig. 1). The first is the allelic pattern at a single 
nucleotide polymorphism (SNP): if a non-African individual carries a 
derived allele seen in Neanderthals but absent from the west-African 
Yoruba from Ibadan, Nigeria (YRI), the allele is likely to originate from 
Neanderthals. The second is high sequence divergence of the non- 
African haplotype to all YRI haplotypes but low divergence to Nean- 
derthal. The third is a haplotype length consistent with interbreeding 
37-86-thousand years ago’*. We trained the CRF using simulations”, 
and established its robustness to deviations from the assumed demo- 
graphy (Supplementary Information section 2). 

We screened for Neanderthal haplotypes in the 1000 Genomes 
Project Phase 1 (1KG) data’, using the Altai Neanderthal genome of 
52-fold average coverage to determine alleles present in Neanderthals’, 
a six-primate consensus to determine ancestral alleles’®, and 176 YRI 
genomes as a reference panel assumed to harbour no Neanderthal 
ancestry (Fig. la). Table 1 reports the mean and standard deviation 
across individuals of the fraction of their ancestry confidently inferred 
to be Neanderthal (probability > 90%). Figure 1b and Extended Data 
Fig. 2 plot the fraction of European (n = 758) and east-Asian (n = 572) 
haplotypes that descend from Neanderthals at each genomic location 
(Supplementary Information section 3). We created a tiling path of 


inferred Neanderthal haplotypes that spans 1.1 gigabases (Gb) over 
4,437 contigs (Supplementary Information section 4), thus filling in 
gaps in the Neanderthal sequence over a number of repetitive regions 
that cannot be reconstructed from short ancient DNA fragments 
(Extended Data Fig. 3). 

Four features of the Neanderthal introgression map suggest that it is 
producing reasonable results. First, when we infer Neanderthal ances- 
try using low-coverage data from Croatian Neanderthals’ we obtain 
correlated inferences (Spearman rank correlation p = 0.88 in Europeans; 
Supplementary Information section 3). Second, in the African Luhya 
in Webuye, Kenya (LWK), the proportion of the genome inferred to be 
Neanderthal is 0.08%, an order of magnitude smaller than in non- 
African populations (Table 1). Third, the proportion of the genome 
with confidently inferred Neanderthal ancestry has a mean of 1.38% in 
east-Asian and 1.15% in European populations (Table 1), consistent 
with previous reports of more Neanderthal ancestry in east-Asian than 
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Figure 1 | Maps of Neanderthal ancestry. a, Individual maps; the marginal 
probability of Neanderthal ancestry for one European-American, one east- 
Asian and one sub-Saharan-African phased genome across chromosome 9. 
b, Population maps; estimate of the proportion of Neanderthal ancestry in 
European individuals (red) and east-Asian individuals (green), averaged across 
all individuals from each population in non-overlapping 100-kb windows on 
chromosome 9. The black bar denotes the coordinates of the centromere. The 
plot is limited to segments of the chromosome that pass filters (see 
Supplementary Information section 8). CEU, residents of Utah, US, with 
northern and western European ancestry (from the Centre d’Etude du 
Polymorphisme Humain (CEPH) collection); CHB, Han Chinese in Beijing, 
China; LWK, African Luhya in Webuye, Kenya. 
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Table 1 | Genome-wide estimates of Neanderthal ancestry 
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Region Population Number of individuals Neanderthal ancestry on autosomes (%) Neanderthal ancestry on the X chromosome (%) 
Europe CEU 85 17 + 0.08 0.21+0.17 
Fl 93 .20+0.07 0.19+0.14 
GBR 89 .15 + 0.08 0.20 +0.15 
IBS 14 07 + 0.06 0.23 +0.18 
TSI 98 .11+0.07 0.25 + 0.20 
East Asia CHB 97 40 + 0.08 0.30+0.21 
CHS 100 37 + 0.08 0.27+0.21 
JPT 89 38+0.10 0.26+0.21 
America CLM 60 14+0.12 0.22+0.16 
MXL 66 22 + 0.09 0.21+0.15 
PUR 55 05+0.12 0.20+0.15 
Africa LWK 97 0.08 + 0.02 0.04 + 0.07 
ASW 61 0.34 +0.22 0.07+0.11 
For each computationally phased genome in each population, we estimated the probability of Neanderthal ancestry at each SNP and the fraction of autosomal and X chromosome SNPs that are confidently of 
Neanderthal origin in each individual (marginal probability >90%). The table reports the average and standard deviation of this statistic across individuals within each population. ASW, people with African 


ancestry in Southwest United States; CEU, Utah residents with northern and western European ancestry (from the Centre d’Etude du Polymorphisme Humain (CEPH) collection); CHB, Han Chinese in Beijing, 
China; CHS, Han Chinese in South China; CLM, Colombians in Medellin, Colombia; FIN, Finnish in Finland; GBR, British from England and Scotland, UK; IBS, Iberian populations in Spain; JPT, Japanese in Tokyo, 
Japan; LWK, African Luhya in Webuye, Kenya; MXL, people with Mexican ancestry in Los Angeles, California; PUR, Puerto Ricans in Puerto Rico; TSI, Toscani in Italy. 


in European populations””’. Fourth, the standard deviation in Nean- 
derthal ancestry among individuals from within the same population 
is 0.06-0.10%, in line with theoretical expectation (Supplementary 
Information section 3), showing that Neanderthal ancestry calculators 
that estimate differences on the order of a per cent’® are largely infer- 
ring statistical noise. 

The Neanderthal introgression map reveals locations where Nean- 
derthal ancestry is inferred to be as high as 62% in east-Asian and 64% 
in European populations (Fig. 1b and Extended Data Fig. 2). Several of 
these regions provide evidence of positive selection if we assume a 
model in which the distribution of Neanderthal ancestry has been 
governed by neutral drift; however, this assumption is problematic 
in light of the evidence for widespread negative selection against Nean- 
derthal ancestry reported below (Supplementary Information section 5). 
To further explore whether Neanderthal alleles may have been affected 
by positive selection, we examined the 5% of genes with the highest 
inferred Neanderthal ancestry. We do not detect tissue-specific expres- 
sion patterns; however genes involved in keratin filament formation 
and some other biological pathways are significantly enriched in Nean- 
derthal ancestry in European populations, east-Asian populations, or 
both (Extended Data Table 1 and Supplementary Information section 6). 
Thus, Neanderthal alleles that affect skin and hair may have helped 
modern humans to adapt to non-A frican environments. We also inves- 
tigated the relevance of Neanderthal alleles to present-day human 
biology by identifying alleles of Neanderthal origin (Supplementary 
Information section 7), and overlapping this list with alleles that have 
been associated with phenotypes of medical relevance’””’. We identify 
alleles of Neanderthal origin that affect lupus, biliary cirrhosis, Crohn’s 
disease, optic-disk size, smoking behaviour, IL-18 levels and type 2 
diabetes” (Extended Data Table 2). 

The most striking feature of the introgression map is large ‘deserts’ 
of Neanderthal ancestry: on a 10-megabase (Mb) scale on the auto- 
somes, there are 4 windows in European and 14 in east-Asian popula- 
tions with Neanderthal ancestry < 0.1% (Extended Data Fig. 2 and 
Supplementary Information section 8). Two analyses show that these 
deserts are not artefacts of reduced power to detect ancestry. First, these 
regions were detected using a probability threshold for calling a segment 
as Neanderthal of >25%, which results in a much higher sensitivity to 
true segments of Neanderthal ancestry than does a threshold of >90% 
(Supplementary Information section 3). Second, when we estimate Nean- 
derthal ancestry in regions of low recombination rate where Neanderthal 
haplotypes are longer so that we have more power to detect them, we 
see a decreased Neanderthal ancestry proportion, opposite to the expec- 
tation from increased power (p = 0.221, P= 4.4 X 10 * in European 
individuals; p = 0.226, P= 1.9 X 10 “in east-Asian individuals) (Sup- 
plementary Information section 8). As we also observe multi-megabase 
regions of increased Neanderthal ancestry, part of the explanation for 
the ancestry deserts is likely to be small population sizes shortly after 


interbreeding (Supplementary Information section 8). However, selec- 
tion also seems to have contributed to Neanderthal ancestry deserts, as 
we detect a correlation to functionally important regions (below). 

To explore whether selection provides part of the explanation for 
regions of reduced Neanderthal ancestry, we tested for a correlation of 
Neanderthal ancestry to a previously published ‘B statistic’, in which 
low B implies a high density of functionally important elements*’. We 
find that low B is significantly correlated to low Neanderthal ancestry: 
p = 0.32 in European populations (P = 4.9 x 107°”) and p = 0.31 in 
east-Asian populations (P = 3.88 X 10 °°) (Fig. 2 and Supplementary 
Information section 8). This is not an artefact of reduced power, as 
there is expected to be reduced genetic variation in regions of low B 
which should make introgressed Neanderthal haplotypes stand out 
more clearly (Extended Data Table 3 and Supplementary Information 
section 2). We also estimated Neanderthal ancestry in quintiles of B 
statistic using an approach that is not biased by varying mutation rates, 
recombination rates, or genealogical tree depth”, and confirmed that 
the quintile with the highest B has significantly higher Neanderthal 
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Figure 2 | Functionally important regions are deficient in Neanderthal 
ancestry. The median of the proportion of Neanderthal ancestry (estimated as 
the average over the marginal probability of Neanderthal ancestry assigned 
to each individual allele at a SNP) within quintiles of a B statistic that measures 
proximity to functionally important regions (1-low, 5-high). We show 

results on the autosomes and the X chromosome, and in European and 
east-Asian populations. 


20 MARCH 2014 | VOL 507 | NATURE | 355 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Table 2 | Enrichment of tissue-specific genes in regions deficient in Neanderthal ancestry 


Tissue European whole genome European X chromosome European autosomes East-Asian whole East-Asian X chromosome East-Asian autosomes 
genome 
Adipose 0.93 1 0.81 0.99 i 0.95 
Adrenal 0.5 NA 0.5 0.42 NA 0.42 
Blood 0.99 0.98 0.89 0.94 0.73 0.94 
Brain 1 aR 1 1 1 1 
Breast 0.98 0.63 089 i! 0.94 1 
Colon 0.64 0.77 0.63 0.94 0.97 0.89 
Heart 0.99 0.71 0.99 0.8 0.57 0.81 
Kidney 1 O15 1 1 0.08 1 
Liver 0.99 0.99 0.99 1 0.86 1 
Lung 0.96 0.64 0.96 0.99 0.87 0.99 
Lymph 0.88 0.62 0.9 0.99 0.51 0.99 
Ovary 0.84 0.95 0.81 0.62 O91 0.58 
Prostate 1 0.79 1 1 0.73 1 
Muscle 0.95 0.7 095 0.83 0.1 0.88 
Testes 0.0095 0.13 0.016 0.018 0.039 0.055 
Thyroid 0.86 0.62 0.88 0.87 0.94 0.86 


Tissue-specific genes (defined as those that are significantly more highly expressed in the specified tissue than in any of the 15 other tissues) are compared to all other expressed genes in that tissue. Of the 16 tissues 
tested, only testes-specific genes are significantly enriched in the regions deficient in Neanderthal ancestry, defined as locations in which all sites across all individuals are assigned a marginal probability of Neanderthal 
ancestry of <10% (47% of genes in European individuals and 52% of genes in east-Asian individuals fall into this category). NA, no tissue-specific genes for this tissue on the X chromosome. Units are P values. 


ancestry than the other quintiles (P = 7 X 10 “) (Extended Data Table 4 
and Supplementary Information section 9). 

The largest deserts of Neanderthal ancestry are on the X chromosome, 
where the mean Neanderthal ancestry is about a fifth of the autosomes 
(Table 1). The power of our CRF to detect Neanderthal ancestry is 
higher on the X chromosome than on the autosomes (Extended Data 
Table 5 and Supplementary Information section 2), implying that this 
observation cannot be an artefact of reduced power. At least some of the 
reduction in Neanderthal ancestry that we observe on the X chro- 
mosome must be due to selection, since—just as on the autosomes— 
we observe that Neanderthal ancestry is positively correlated with B 
statistic (9 = 0.276, P = 3.1 X 10 *for European populations; p = 0.176, 
P = 0.02 for east-Asian populations) (Fig. 2 and Supplementary Infor- 
mation section 8). Studies in many species have shown that genes 
responsible for reduced male fertility disproportionally map to the X 
chromosome (refs 10-12). We reasoned that this ‘large X effect’? could 
explain why the X chromosome was more resistant to introgression of 
Neanderthal ancestry than the autosomes. 

If male hybrid sterility is contributing to our observations, a predic- 
tion is that the responsible genes will be disproportionally expressed in 
testes’. To test this hypothesis, we analysed gene transcripts from 16 
human tissues” and defined ‘tissue-specific’ genes as those with a signi- 
ficantly higher expression level in that tissue than any other. We found that 
only genes that are specific to testes were enriched in regions of low 
Neanderthal ancestry (when compared with all other genes expressed 
in the same tissue). This effect remained significant after permuting gene 
annotations while preserving the correlation structure between Nean- 
derthal ancestry and gene expression (P = 0.0095 in European populu- 
latons; P = 0.018 in east-Asian populations) (Table 2 and Supplementary 
Information section 6). However, hybrid sterility is not the only factor 
responsible for selection against Neanderthal material, as Neanderthal 
ancestry is also depleted in conserved pathways such as RNA proces- 
sing (P< 0.05; Extended Data Table 2 and Supplementary Informa- 
tion section 6). 

We have shown that interbreeding of Neanderthals and modern 
humans introduced alleles onto the modern human genetic background 
that were not tolerated, which probably resulted in part from their 
contributing to male hybrid sterility. The resulting reduction in Nean- 
derthal ancestry was quantitatively large: in the fifth of the genome 
with highest B, Neanderthal ancestry is 1.54 + 0.15 times the genome- 
wide average (Extended Data Table 4 and Supplementary Information 
section 9)**. If we assume that this subset of the genome was unaffected 
by selection, this implies that the proportion of Neanderthal ancestry 
shortly after introgression must have been >3% rather than the approx- 
imately 2% seen today. The large effect of negative selection on present- 
day levels of Neanderthal ancestry may explain why the proportion of 
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Neanderthal ancestry is significantly higher in present-day east-Asian 
than in European populations (Table 1)”"’; there is evidence that east- 
Asian population sizes have been smaller than European populations 
for some of the time since their separation”®, which could have resulted 
in less efficient selection to remove Neanderthal-derived deleterious 
alleles. The evidence for male hybrid sterility is particularly remark- 
able when compared with mixed populations of present-day humans 
in which no convincing signals of selection against alleles inherited 
from one of the mixing populations have been found despite high power 
to detect such effects”’. Thus, although the time of separation between 
Neanderthals and modern humans was only about five times larger 
than that between present-day European and west-African populations’, 
the biological incompatibility was far greater. A potential explanation 
is the ‘snowball effect’, whereby hybrid sterility genes are expected to 
accumulate in proportion to the square of the substitutions between 
two taxa because two interacting loci need to change to produce an 
incompatibility (“‘Dobzhansky-Muller incompatibilities’)”*. An important 
direction for future work is to explore whether similar phenomena have 
affected other interbreeding events between diverged hominin groups. 


METHODS SUMMARY 


We infer the distribution of Neanderthal ancestry at each allele in a phased test 
genome using a conditional random field (CRF)'*. The data we use consist of a 
panel of African reference genomes, the high-coverage Neanderthal genome, anda 
genetic map. For a set of CRF parameters, we can compute the probability of Nean- 
derthal ancestry at each allele using the forward-backward algorithm (Supplemen- 
tary Information section 1). Parameter estimation in the CRF needs training data 
(haplotypes labelled with true Neanderthal ancestries). We estimated the CRF 
parameters from data simulated under a demography relating European indivi- 
duals, west-African individuals and Neanderthals (Supplementary Information 
section 2). We assessed the robustness of the CRF by measuring its false discovery 
rate in simulated demographic models that are different from the ones used in the 
simulation to train the CRF (Supplementary Information section 2). We also 
assessed the robustness to errors in phasing and in the genetic map (Supplemen- 
tary Information section 2). For some analyses we estimate Neanderthal ancestry 
proportion based on all alleles where the marginal probability of Neanderthal 
ancestry as inferred by the CRF is greater than a specified threshold. For other 
analyses, we compute it as the average marginal probability of Neanderthal ancestry. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Conditional random field for inferring Neanderthal local ancestry. For a hap- 
loid genome in a test population that carries Neanderthal ancestry (for example, 
Europeans), given the allelic states of a sequence of SNPs along this haplotype, we 
wish to infer the ancestral state of the allele at each SNP; specifically, whether it has 
entered modern humans through Neanderthal gene flow. In addition to the test 
haplotype, the data analysed consist of a panel of haplotypes from the sub-Saharan- 
African Yoruba (YRI) who we assume harbour no Neanderthal ancestry’. To 
determine the allelic state of the Neanderthals, we used a high-coverage Nean- 
derthal genome’. We determined the ancestral and derived allele at each SNP using 
a six-primate consensus sequence’’. To estimate the genetic distance between 
adjacent SNPs, we used the Oxford combined linkage-disequilibrium map”. We 
specified the distribution of the unobserved Neanderthal ancestry states at each 
SNP given the observed genetic data as a conditional random field (CRF). We 
specified CRF feature functions that relate the observed data and the unobserved 
ancestral state at each SNP (‘emission functions’) as well as feature functions that 
relate the unobserved ancestral states at adjacent SNPs (‘transition functions’). 
Thus, the model is a linear-chain CRF. The feature functions and their associated 
parameters fully specify the distribution of the unobserved ancestral states given 
the observed data. Given the parameters and the observed data, we were able to 
infer the marginal probability of Neanderthal ancestry at each SNP of the haploid 
genome. We computed the marginal probabilities efficiently using the forward- 
backward algorithm'*”°. Supplementary Information section 1 presents the math- 
ematical details. 

Feature functions. The emission functions couple the unobserved ancestral state 
at a SNP to the observed features. We used two classes of emission functions. 

The first class of emission functions captures information from the joint pat- 
terns observed at a single SNP across European and African individuals, and 
Neanderthals. These features are indicator functions that assume the value ‘I’ 
when a specific pattern is observed at a SNP and ‘0’ otherwise. We used feature 
functions that pick out two classes of allelic patterns. One of these features is 1 if at 
a given SNP, the test haplotype carries the derived allele, all the YRI haplotypes 
carry the ancestral allele, and either of the two Neanderthal alleles is derived. SNPs 
with this joint configuration have an increased likelihood of Neanderthal ancestry. 
In the CRF, an increased likelihood associated with this feature is reflected in the 
fact that the parameter is positive with a magnitude determined by the informa- 
tiveness of the feature. The second feature is 1 if at a given SNP the test haplotype 
carries a derived allele that is polymorphic in the panel of African individuals but 
absent in the Neanderthal. SNPs with this joint configuration have a decreased 
likelihood of Neanderthal ancestry. 

The second class of emission functions uses multiple SNPs to capture the signal 
of Neanderthal ancestry. Specifically, we compared the divergence of the test 
haplotype to the Neanderthal sequence to the minimum divergence of the test 
haplotype to all African haplotypes over non-overlapping 100-kilobase (kb) win- 
dows (the size scale we expect for Neanderthal haplotypes today based on the time 
of Neanderthal gene flow into modern humans”). In a region of the genome in 
which the test haplotype carries Neanderthal ancestry, we expect the test haplotype 
to be closer to the Neanderthal sequence than to most modern human sequences 
(albeit with a large variance), and we expect the pattern to be reversed outside these 
regions. We chose the derived allele at heterozygous sites so that this distance was 
effectively the minimum distance of the potentially introgressed test haplotype to 
one of the two Neanderthal haplotypes. 

The transition feature function modulates the correlation of the ancestral states 
at adjacent SNPs. We defined this feature function as an approximation, at small 
genetic distance, to the log of the transition probabilities of a standard Markov 
process of admixture between two populations. This approximation makes para- 
meter estimation in the CRF efficient. 

Parameter estimation. To estimate the parameters of the CRF, we needed hap- 
lotypes labelled with Neanderthal ancestries; that is, training data. As we do not in 
fact know the true Neanderthal state in any individual, we estimated the CRF 
parameters on data simulated under a demographic model. We estimated para- 
meters by maximizing the L2-regularized conditional log likelihood* using a 
limited-memory version of LBFGS (limited memory Broyden-Fletcher-Goldfarb- 
Shanno)*'. We fixed the value of the parameter associated with the L2 penalty at 10 
although a broad range of values seem to work in practice. We assumed a simple 
demographic model relating African and European individuals and Neanderthals 
with Neanderthal-modern human admixture occurring 1,900 generations ago" 
and a fraction of Neanderthal ancestry of 3%’. The model parameters were broadly 
constrained by the observed allele frequency differentiation between the west- 
African YRI and European-American CEU populations, and by the observed 
excess sharing of alleles between European and African populations relative to 
Neanderthals. The simulations incorporated hotspots of recombination’ as well as 


the reduced power to detect low-frequency alleles from low-coverage sequencing 
data’’. 

Validation of the conditional random field. We assessed the accuracy of the CRF 
to predict Neanderthal ancestry using simulated data. Given the marginal prob- 
abilities estimated by the CRF, we estimated the precision (fraction of predictions 
that are truly Neanderthal) and the recall (fraction of true Neanderthal alleles that 
are predicted) as we varied the threshold on the marginal probability for an allele 
to be declared Neanderthal. We also evaluated the accuracy when the haplotype 
phase needed to be inferred and when the genetic map had errors. As the CRF 
parameter estimation assumes a specific demographic model, we were concerned 
about the possibility that the inferences might be sensitive to the model assumed. 
We therefore varied each demographic parameter in turn and applied the CRF to 
data simulated under these perturbed models, fixing the parameters of the CRF 
to the estimates obtained under the original model. For each of these perturbed 
models, we evaluated the false discovery rate (defined as one minus the precision) 
when we restricted to sites at which the CRF assigns a marginal probability of at 
least 0.90. Supplementary Information section 2 presents the details. 
Preparation of the 1000 Genomes data. We applied the CRF to the computa- 
tionally phased haplotypes in each of the 13 populations in the 1000 Genomes 
Project’ (1KG), excluding the west-African Yoruba (YRI). The CRF requires ref- 
erence genomes from African individuals and Neanderthals. For the African 
population, we used 176 haplotypes from 88 YRI individuals. For the Neanderthal 
genome, for most analyses we used the genotypes called from the recently gener- 
ated high-coverage Neanderthal sequence’. We restricted our analysis to sites 
passing the filters described in ref. 2 and for which the genotype quality score was 
=30. These filters discarded sites that are identified as repeats by the Tandem 
Repeat Finder annotation for hg19 from the University of California Santa Cruz 
(UCSC) genome browser (available at http://hgdownload.soe.ucsc.edu/goldenPath/ 
hg19/database/simpleRepeat.txt.gz), that have Phred-scaled mapping quality scores 
of <30, or that map to regions in which the alignment is ambiguous or which fall 
within the upper or lower 2.5th percentile of the sample-specific coverage distri- 
bution (applied within the regions of unique mappability binned according to the 
GC content of the reference genome). For the mappability filter, we used the liberal 
filter that requires that at least 50% of all 35-mers that overlap a position do not 
map to any other position in the genome allowing up to one mismatch. We further 
restricted our analysis to sites that are biallelic across the Neanderthal and the 1000 
Genomes Project samples. After filtering, we were able to analyse 26,493,206 SNPs 
on the autosomes and 817,447 SNPs on the X chromosome. For each haplotype 
analysed, we also restricted to the set of polymorphic sites in the population 
containing the haplotype. We obtained genetic distances from the Oxford com- 
bined linkage-disequilibrium map lifted over to hg19 coordinates”. For the X 
chromosome, we obtained an appropriate sex-averaged map by scaling the X 
chromosome linkage-disequilibrium-based map by two-thirds. 

Statistics for measuring Neanderthal ancestry. We computed several statistics 
to summarize the Neanderthal ancestry inferred by the CRF. We estimated the 
proportion of an individual diploid genome that is confidently inferred to be 
Neanderthal as the fraction of sites for which the marginal probability is =90%. 
To assess variation in the proportion of Neanderthal ancestry along the genome, 
we computed the fraction of alleles across individuals with marginal probability 
greater than a specified threshold. This statistic is likely to be affected by variation 
in power along the genome. Hence, we also consider an estimate of the ancestry 
proportion obtained by averaging the marginal probability across individuals. 
Depending on the analyses, these statistics are estimated at a single SNP or in 
non-overlapping windows of a specified size. 

To assess whether the predictions made by the CRF are sensible, we inferred 
Neanderthal ancestry using the low-coverage genome from the Vindija Neanderthals’. 
For this analysis, we restricted to sites at which there is at least one read with 
mapping quality score between 60 and 90 and base quality of at least 40. As a 
second validation analysis, we applied the CRF to the sub-Saharan-African Luhya 
(LWK), using the parameters estimated with non-A frican individuals. We empir- 
ically assessed the accuracy of the CRF on the 1000 Genomes Project data by 
assuming that LWK has no Neanderthal ancestry, that the false discovery rate in 
each non-African population is equal to the false discovery rate in LWK, and using 
the genome-wide proportion of Neanderthal ancestry estimated in ref. 2. (Sup- 
plementary Information section 3). We computed the theoretical standard devi- 
ation in the proportion of Neanderthal ancestry” assuming a pulse model of 
admixture with 2% Neanderthal ancestry followed by 2,000 generations of random 
mating, and 2.03 gigabases as the number of bases of the high-coverage Neanderthal 
genome that pass filters. 

Tiling path of Neanderthal haplotypes. We identified Neanderthal haplotypes as 
runs of consecutive alleles along a test haplotype assigned a marginal probability of 
>90%. We filtered haplotypes smaller than 0.02 centiMorgans (cM). At each SNP 
that is covered by at least one such haplotype, we estimated the allelic state as the 
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consensus allele across the spanning haplotypes. See Supplementary Information 
section 4. 

Functional analysis of introgressed alleles. We defined two subsets of consensus 
coding sequence (CCDS) genes**. We define a gene with ‘low Neanderthal ances- 
try’ as one in which all alleles across all individuals have a marginal probabil- 
ity =10%. We also require that the genes included in this analysis include at 
least =100 SNPs within a 100-kb window centred at its midpoint (this excluded 
genes at which the power to infer Neandertal ancestry is reduced due to a reduced 
number of informative sites). We define a gene with ‘high Neanderthal ancestry’ as 
one that is in the top 5% of CCDS genes ranked by the average marginal probability 
across individual haplotypes. See Supplementary Information section 6 for details. 
Functional enrichment analysis. We tested for enrichment of Gene Ontology 
(GO)* categories in genes with low or high Neanderthal ancestry, using the hyper- 
geometric test implemented in the FUNC package**. We report multiple-testing 
corrected P values estimated from 1,000 permutations for the GO enrichment 
analysis (family-wise error rate, FWER). Given the observed correlation between 
Neanderthal ancestry and B statistic’’, a concern is that the functional categories 
may not be randomly distributed with respect to B-statistic. To control for this, we 
assigned a B statistic to each gene (estimated as an average of the B statistic over the 
length of the gene) as well as a uniform random number. This resulted in 17,249 
autosomal genes. Genes were binned into 20-equal sized bins based on the gene- 
specific B statistic. Within each bin, genes were sorted by Neanderthal ancestry and 
then by the random number. Genes ranked within the top 5% within each bin were 
used for the analysis. See Supplementary Information section 6 for details. 
Identification of alleles born in Neanderthals and cross-correlation with 
association-study data. To infer whether an allele segregating in a present-day 
human population was introduced by Neanderthal gene flow, we defined a probable 
Neanderthal allele as one with marginal probability of =90% and a non-Neanderthal 
allele as having a marginal probability of =10%. A SNP at which all of the confident 
non-Neanderthal alleles as well as all alleles in YRI are ancestral and all of the 
confident Neanderthal alleles are derived is inferred to be of Neanderthal origin. 
This procedure allows for some false negatives in the prediction of the CRF. This 
procedure yields 97,365 Neanderthal-derived SNPs when applied to the predic- 
tions in European and east-Asian individuals. We downloaded the variants listed 
in the NHGRI (National Human Genome Research Institute) GWAS catalogue’, 
retaining entries for which the reported association is a SNP with an assigned rs 
number, and for which the nominal P value is less than 5 X 10°. This resulted in 
5,022 associations, which we then intersected with the Neanderthal-derived list. 
See Supplementary Information section 7 for details. 

Identification of genomic regions deficient in Neanderthal ancestry. We mea- 
sured the fraction of alleles across individuals and SNPs that are assigned a marginal 
probability greater than a chosen threshold measured within non-overlapping 10- 
megabase (Mb) windows. We chose a threshold of 25% as this threshold was found 
to lead to high recall in our empirical assessment (Supplementary Information 
section 3). We reported windows for which this statistic is <0.1%. 

To understand the causes for variation in Neanderthal ancestry, we tested for 
correlation to a B statistic. Each SNP was annotated with the B statistic lifted over 
to hg19 coordinates. We assessed correlation between B and estimates of Neanderthal 
ancestry proportion at a nucleotide level as well as at different size scales, and 
separately on the autosomes and the X-chromosome (Supplementary Information 
section 8). We also used wavelet decomposition”* to analyse the correlation of the 
inferred Neanderthal ancestry between European and east-Asian individuals at 
multiple size scales (Supplementary Information section 10). Figure 2 reports the 
relation between the mean marginal probability of Neanderthal ancestry across 
individuals and quintiles of B statistic at each SNP. To assess significance, we 
estimated Spearman’s correlation p and standard errors on p using a block jack- 
knife’” with 10-Mb blocks (Supplementary Information section 8). 

To understand the contribution of demography to variation in Neanderthal 
ancestry along the genome, we measured the coefficient of variation, at a 10-Mb 
scale, of the proportion of ancestry estimated as defined above. We then applied 
the CRF to data simulated under diverse demographic models and compared the 
coefficient of variation to the observed value (Supplementary Information section 8). 
Assessment of the power to infer Neanderthal ancestry. To assess the power of 
the CRF to infer Neanderthal ancestry, we simulated data under diverse demo- 
graphic models”. In one simulation series, we varied effective population size to 
approximate the effect of background selection and measured recall at a precision 
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of 90%. In a second series, we assessed power on the X-chromosome versus the 
autosomes by matching the effective population size, recombination rate and 
mutation rate to estimated values for the X chromosome. See Supplementary 
Information section 2. 

Unbiased estimate of the proportion of Neanderthal ancestry. To estimate the 
proportion of Neanderthal ancestry in an unbiased way, we divided the genome 
into quintiles of B, and estimated the proportion of Neanderthal ancestry using 
a statistic first published in ref. 22. This statistic measures how much closer 
Denisova is to a non-African individual than to an African individual, divided 
by the same quantity replacing the non-African individual with Neanderthal. We 
report the estimated proportion of Neanderthal ancestry in each quintile divided 
by the genome-wide mean and obtain standard errors using a block jackknife with 
100 blocks. 

We analysed data from 27 deeply sequenced genomes: 25 present-day humans 

and the high-coverage Neanderthal and Denisova'’ genomes. For each, we 
required that sites passed the more stringent set of the two filters described in 
ref. 2, had a genotype quality of =45, and had an ancestral allele that could be 
determined based on comparison to chimpanzee and at least one of gorilla or 
orangutan. We computed a Z-score for the difference in the ancestry across the 
bin of highest B statistic versus the rest and used a Bonferroni correction for ten 
hypotheses (5 hypotheses based on which set of bins we merge and a two-sided test 
in each). In our main analysis, we analysed both transitions and transversions and 
pooled genomes for all non-A frican-individual samples. To establish robustness, 
we also analysed subsets of the data, repeating the analysis restricting to transver- 
sion SNPs, and then further restricting to transversion SNPs only in European and 
only in eastern-non- African individuals. See Supplementary Information section 9 
for details. 
Tissue-specific expression. We defined tissue-specific expression levels using the 
Illumina BodyMap 2.0 RNA-seq data**, which contains expression data from 16 
human tissues. We identified genes that are expressed in a tissue-specific manner 
using the DESeq package” and used a P-value cutoff of 0.05. We tested enrichment 
of tissue-specific genes in regions of high or low Neanderthal ancestry. A concern 
when testing for enrichment is that clustering of similarly expressed genes coupled 
with the large size of regions of low Neanderthal ancestry might lead to spurious 
signals of enrichment. Hence, we devised a permutation test that randomly rotates 
the annotations of genes (treating each chromosome as a circle) while maintaining 
the correlation within genes and within Neanderthal ancestry as well as between 
Neanderthal ancestry and genes. We tested enrichment on the whole genome, on 
the autosomes alone, and on the X chromosome alone. We generated 1,000 ran- 
dom rotations for each test except for the X chromosome for which we generated 
all possible rotations. We computed the fraction of permutations for which the 
P value of Fischer’s exact test is at least as low as the observed P value (see Sup- 
plementary Information section 6). 
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Extended Data Figure 1 | Three features used in the Conditional Random 
Field for predicting Neanderthal ancestry. Top (feature 1), patterns of 
variation at a single SNP. Sites at which a panel of sub-Saharan-African 
individuals carry the ancestral allele and in which the sequenced Neanderthal 
and the test haplotype carry the derived allele are likely to be derived from 
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sequenced Neanderthal is low, whereas the divergence to a panel of sub- 
Saharan-African individuals is high, are likely to be introgressed. Bottom 
(feature 3), we searched for segments that have a length consistent with what is 
expected from Neanderthal-to-modern-human gene flow approximately 2,000 
generations ago, corresponding to a size of about 0.05 cM = (100 cM per 


LETTER 


Fraction of confidently inferred Neanderthal alleles 
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Extended Data Figure 2 | Map of Neanderthal ancestry in 1000 Genomes Asians (green). Black bars denote the coordinates of the centromeres. We plot 
European and east-Asian populations. For each chromosome, we plot the traces in non-overlapping 10-Mb windows that pass filters. We label 10-Mb- 
fraction of alleles confidently inferred to be of Neanderthal origin (probability scale windows that are deficient in Neanderthal ancestry (e1-e9 (e, European), 
>90%) in non-overlapping 1-Mb windows in Europeans (red) and in east al-al7 (a, Asian)) (see Supplementary Information section 8 for details). 
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Extended Data Figure 3 | Tiling path from confidently inferred haplotypes to be at least 0.02 cM long. b, Distribution of contig lengths obtained 
Neanderthal haplotypes. a, Example tiling path at the BNC2 locus on by constructing a tiling path across confidently inferred Neanderthal 
chromosome 9 in European individuals. Red, confidently inferred Neanderthal _ haplotypes. On merging Neanderthal haplotypes in each of the 1000 Genomes 
haplotypes in a subset of these individuals; blue, resulting tiling path. We European and east-Asian populations, we reconstructed 4,437 Neanderthal 


identified Neanderthal haplotypes by scanning for runs of consecutive SNPs _ contigs with median length 129 kb. 
along a haplotype with a marginal probability >90% and requiring the 
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Extended Data Table 1 | Gene categories enriched or depleted in Neanderthal ancestry 


Biological pathway Neandertal ancestry Europe FWER East Asian FWER 
(GO categorization) 

nucleic acid binding (molecular_function, GO:0003676) Depleted 0.018 0.032 
RNA processing (biological_process, GO:0006396) Depleted 0.004 0.049 
ribonucleoprotein complex (cellular_component, GO:0030529) Depleted <0.001 0.027 
organelle part (cellular_component, GO:0044422) Depleted <0.001 0.037 
intracellular organelle part (cellular_component, GO:0044446) Depleted <0.001 0.025 
mRNA metabolic process (biological_process, GO:0016071) Depleted <0.001 0.014 
nuclear lumen (cellular_component, GO:0031981) Depleted 0.039 0.017 
nuclear part (cellular_component, GO:0044428) Depleted 0.005 0.022 
keratin filament (cellular_component, GO:0045095) Enriched <0.001 <0.001 


Enrichment of Gene Ontology categories in genes with depleted or elevated Neanderthal ancestry was assessed using the hypergeometric test implemented in the FUNC package. We report family-wise error rate 
(FWER) P values associated with each GO category (P values corrected for the testing of multiple categories). 
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Extended Data Table 2 


Neanderthal-derived alleles that have been associated with phenotypes in genome-wide association studies 


rs id Coordinates Derived | Derived allele frequency (%) Phenotype 

allele Europeans | East Asians 
rs12531711 7:128,617,466 G 10.03 0.17 Systemic lupus erythematosus, Primary biliary cirrhosis 
183025343 9:136,478,355 A 8.44 0.00 Smoking behavior 
rs7076156 10:64,415,184 A 26.52 8.74 Crohn’s disease 
rs12571093 10:70,019,371 A 16.35 14.86 Optic disc size 
rs1834481 11:112,023,827 | G 21.50 0.35 Interleukin-18 levels 
rs11175593 12:40,601,940 T 1.98 3.32 Crohn’s disease 
1s75493593 17:6,945,087 T 1.85 12.06 
rs75418188 17:6,945,483 iT 1.85 11.54 Type-2 Diabetes 
18117767867 | 17:6,946,330 ul 1.85 11.54 


We identified alleles that are likely to 


ave been introduced by Neanderthal gene flow (Supplementary Information section 7) and intersected these alleles with SNPs that have been shown to be associated with 


phenotypes (from the NHGRI GWAS catalogue’? as well from a recent GWAS for type 2 diabetes”°). 
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Extended Data Table 3 | Recall of the CRF as a function of the effec- 
tive population size 


Effective population size Recall 

2500 0.552 + 0.009 
5000 0.506 + 0.009 
7500 0.430 + 0.006 
10000 0.384 + 0.006 


Recall is computed ata precision of 90%. Standard errors of the recall are estimated by a block jackknife 
with 100 blocks. 
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Extended Data Table 4 | Unbiased estimate of the proportion of Neanderthal ancestry as a function of the B statistic 


11 Non-Africans 11 Non-Africans 4 Europeans 7 Eastern 
(transitions + (transversions only) _(transversions only) (transversions only) 
transversions) 
Est. Err. Est. Err. Est. Err. Est. Err. 
Quintile 1: B=0-0.63 0.641 0.304 0.672 0.316 0.472 0.397 0.778 0.317 
Quintile 2: B=0.63-0.80 0.825 0.209 0.779 0.234 0.849 0.290 0.750 0.236 
Quintile 3: B=0.80-0.88 0.578 0.248 0.745 0.298 0.987 0.349 0.647 0.297 
Quintile 4: B=0.88-0.94 0.684 0.184 0.676 0.208 0.446 0.256 0.771 0.221 
Quintile 5: B=0.94-1.00 1.537 0.152 1.445 0.164 1.502 0.185 1.419 0.177 
Bz0.94 vs. B < 0.94 Z=3.82 Z=3.02 Z=3.12 Z=2.58 
Correct for 10 hypotheses P=0.00066 P=0.013 P=0.0090 P=0.049 


Estimates of the proportion of Neanderthal ancestry in quintiles of B statistics divided by the genome-wide proportion. We find a significant excess of Neanderthal ancestry in the quintile with the highest B statistic 
relative to the remaining four quintiles (significant after correcting for ten hypotheses). 
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Extended Data Table 5 | Recall of the CRF on the X chromosome 
versus the autosomes 


Recall 
Autosomes 0.384 + 0.006 
4 0.495 + 0.009 


Simulations are carried out using parameters tailored to be appropriate to each of these compartments 
of the genome. Recall is computed at a precision of 90%. Standard errors of the recall are estimated by a 
block jackknife with 100 blocks. 
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A dedicated circuit links direction-selective retinal 
ganglion cells to the primary visual cortex 


Alberto Cruz-Martin'’, Rana N. El-Danaf'’, Fumitaka Osakada’, Balaji Sriram”, Onkar S. Dhande’, Phong L. Nguyen’”, 
Edward M. Callaway’, Anirvan Ghosh* & Andrew D. Huberman!*? 


How specific features in the environment are represented within 
the brain is an important unanswered question in neuroscience. A 
subset of retinal neurons, called direction-selective ganglion cells 
(DSGCs), are specialized for detecting motion along specific axes 
of the visual field’. Despite extensive study of the retinal circuitry 
that endows DSGCs with their unique tuning properties”’, their 
downstream circuitry in the brain and thus their contribution to 
visual processing has remained unclear. In mice, several different 
types of DSGCs connect to the dorsal lateral geniculate nucleus 
(dLGN)**, the visual thalamic structure that harbours cortical relay 
neurons. Whether direction-selective information computed at the 
level of the retina is routed to cortical circuits and integrated with 
other visual channels, however, is unknown. Here we show that there 
is a di-synaptic circuit linking DSGCs with the superficial layers of 
the primary visual cortex (V1) by using viral trans-synaptic circuit 
mapping”* and functional imaging of visually driven calcium sig- 
nals in thalamocortical axons. This circuit pools information from 
several types of DSGCs, converges in a specialized subdivision of the 
dLGN, and delivers direction-tuned and orientation-tuned signals 
to superficial V1. Notably, this circuit is anatomically segregated 
from the retino-geniculo-cortical pathway carrying non-direction- 
tuned visual information to deeper layers of V1, such as layer 4. Thus, 
the mouse harbours several functionally specialized, parallel retino- 
geniculo-cortical pathways, one of which originates with retinal DSGCs 
and delivers direction- and orientation-tuned information specif- 
ically to the superficial layers of the primary visual cortex. These 
data provide evidence that direction and orientation selectivity of 
some V1 neurons may be influenced by the activation of DSGCs. 

Visual perception involves the activity of neurons in the cerebral 
cortex. The most direct route for visual information to reach the cortex 
is via the ‘retino-geniculo-cortical pathway’ consisting of retinal gan- 
glion cells (RGCs), relay cells in the dLGN and neurons in primary 
visual cortex (V1) (Extended Data Fig. la—c)’. Recently, we and others 
discovered that direction-selective retinal ganglion cells (DSGCs) project 
to the dLGN and therein target a specific layer in the lateral ‘shell’**"® 
(Fig. la-c). Hereafter we also refer to this layer as the DSGC-recipient 
zone or ‘DSGC-RZ (Fig. 1d). 

Previous work showed that the dLGN shell receives input from the 
superior colliculus’ and thus, like other thalamic compartments"’, neu- 
rons in the shell/DSGC-RZ may restrict their connections to subcor- 
tical networks, rather than participating in the retino-geniculo-cortical 
pathway. We infected neurons in the dLGN shell and a small portion of 
the dLGN core by injections of adeno-associated virus (AAV)-tdTomato 
(Fig. le). Within V1, tdTomato* axons were observed in deeper layers 
4 and 6 and superficial layers 1 and 2 (Fig. 1f-i). Thus, neurons in the 
shell/DSGC-RZ probably include thalamocortical relay neurons, but it 
was unclear whether they target specific V1 layers. 

To determine whether there is laminar specificity of mouse geniculo- 
cortical connections we injected retrograde tracers into different V1 


layers (Fig. 2a—c) and analysed the position of the retrogradely labelled 
neurons in the dLGN (Fig. 2d-i) (Extended Data Fig. 2a—d). Injections 
of all V1 layers retrogradely labelled cells across the full width of the 
dLGN (Fig. 2a, d, g). By contrast, injections directed to V1 layer 4 
preferentially labelled neurons in the dLGN core (Fig. 2b, e, h) and 
injections into superficial V1 layers 1 and 2 preferentially labelled 
neurons in the dLGN shell/DSGC-RZ (Fig. 2c, f, i) (Extended Data 
Fig. 3a-c). These laminar-specific patterns of retrograde labelling were 
independent of retinotopy or eye-specific connectivity (Extended Data 
Fig. 4) and together they indicate that cells in the dLGN core project 
to deeper V1, whereas cells in the dLGN shell/DSGC-RZ preferentially 
target superficial V1 (Extended Data Fig. 3d). 

RGC axons form synapses onto the somas and dendrites of (LGN 
neurons”, the latter of which are not entirely labelled using traditional 
retrograde tracing methods. Thus, we extended our exploration of the 
connections between DSGCs and thalamocortical relay neurons by 
using a glycoprotein-deleted rabies virus expressing mCherry (AG- 
RABV-mCherry) that infects neurons at the level of their axon term- 
inals, leading to Golgi-like expression of mCherry throughout the 
infected cell. We note, however, that by itself AG-RABV-mCherry 
does not pass trans-synaptically”*. Following injections of AG-RABV- 
mCherry into superficial V1 (Fig. 3a, b) we observed relay neurons in 
the dLGN shell expressing mCherry throughout their somas and dend- 
ritic arborizations (Fig. 3c, d). By performing these experiments in 
mice with genetically tagged DSGCs, we determined that the majority 
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Figure 1 | The layer of the dLGN that receives input from DSGCs projects to 
V1. a, GEP* DSGC axons in dLGN. OT, optic tract. Inset, GEP* boutons. 
d, dorsal; m, medial. b, Merged GFP/CTB—594 (all RGC axons) and ¢, with 
DAPI (4’,6-diamidino-2-phenylindole; cell nuclei). a-c, Scale bar, 125 um. 

d, Summary. e, AAV2-tdTomato injection to dLGN. Asterisk, tdTomato* 
neurons in core. Scale bar, 100 um. f, tdTomato* thalamocortical axons in V1. 
L1-L6, layers 1-6; wm, white matter. Scale bar, 150 um. g—i, Higher magnification 
of L4 (g), L5 and L6 (h) and L1 and L2 (i). Arrows, axons in L1. Scale bars, 
g, h, 50 ttm; i, 100 jum. Sixteen mice were used for these experiments. 
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Figure 2 | Parallel, layer-specific thalamocortical circuits in the mouse. 
a-c, Tracer injections to all V1 layers (a), layer 4 (b) and superficial V1 (c). 
d-f, dLGN neurons labelled after full depth injection (d; asterisk, axons of 
L6 neurons; yellow box, neurons in shell; green box, middle; blue box, core) 
deep V1 injection (e) and superficial V1 injection (f; arrows, labelled cells; 
arrowhead, labelled cell outside shell). g-i, Position of retrogradely labelled cells 
along dLGN width. g, Full depth, 35.54 + 1.62% (4 mice; n = 232 cells); 
h, Deep, 68.53 + 1.22% (4 mice; n = 69 cells); i, Superficial, 14.68 = 1.57% 
(4 mice; n = 78 cells). Superficial versus deep = ***P < 0.0001; Tukey’s 
multiple comparisons. c, Scale bar, 200 um. f, Scale bar, 100 tum. 


of the mCherry* cells resided in the DSGC-RZ (Fig. 3e, f) (Extended 
Data Fig. 4) (Methods). We observed putative sites of contact between 
the axon terminals of DSGCs and the dendrites of mCherry* dLGN 
neurons (Fig. 3g), some of which contained VGLUT2 (Fig. 3h-k) 
(Extended Data Fig. 5), a presynaptic glutamate transporter that, in 
the dLGN, arises solely from RGC axon terminals'*. Together, these 
experiments indicate that the neurons in the dLGN shell/DSGC-RZ 
that project to superficial V1 are contacted by, and probably receive 
synaptic input from, On-Off DSGCs. 

To determine whether there is a bona fide di-synaptic circuit linking 
DSGCs to superficial V1, we used rabies-virus trans-synaptic network 
tracing”®. We injected AG-RABV-mCherry into superficial V1 to infect 
neurons in the dLGN shell/DSGC-RZ via their presynaptic terminals 
and, in the same mice, we infected dLGN neurons with an AAV expres- 
sing the rabies glycoprotein and histone-tagged green fluorescent protein 
(AAV2-Glyco-hGFP) (Fig. 4a). In this experimental configuration, the 
double-infected RABV-mCherry* /Glyco-GFP* dLGN relay neurons pro- 
duce infectious AG-RABV-mCherry that propagates trans-synaptically 
to infect and label the RGCs that form synapses with them (Fig. 4a) 
(Methods). A small number of double-infected cells were present in the 
dLGN and these were always located in the shell/DSGC-RZ (Fig. 4b-e) 
(8 mice, n = 21 cells). Moreover, we performed these experiments in 
mice where GFP is selectively expressed by posterior-tuned On-Off 
DSGCs** (Extended Data Fig. 6) and we used immunohistochemical 
markers that recognize multiple On-Off DSGC types'*”* to determine 
(1) whether DSGCs provide di-synaptic input to V1, and (2) if so, 
whether multiple DSGC types feed this pathway. 
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Figure 3 | DSGC axons contact thalamic relay neurons projecting to 
superficial V1. a, AG-RABV-mCherry injection to superficial V1, to infect 
axons of DSGC-RZ neurons. b, V1 injection. Asterisk, infected L5/6 neurons. 
Scale bar, 200 tum. ¢, d, mCherry* dLGN neurons. c, Arrowhead, axon. Scale 
bars, 75 um. d, Arrow, proximal dendrites; arrowheads, distal dendrites. 
Scale bars, 25 um. e, Percentage of mCherry* somas within GEP* DSGC-RZ 
(74.77 + 11.62%; 8 mice, n = 83 cells) (Extended Data Fig. 4). f-h, DSGC axons 
and dLGN somas and dendrites. g, Magnified view of frame in f. Arrowheads, 
putative contact sites'*. Scale bars; f, 125 jum; g, 20 tm. h-k, GFP, mCherry, 
VGLUT2 (per cent mCherry signal contacted by GEP* /VGLUT2* 

profiles = 5.23 + 1.39%; 4 mice; n = 4 cells). k, Scale bar, 2 um. 
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In mice with one to three double-infected dLGN neurons we observed 
1-3mCherry* RGCs, which is consistent with the convergence of mouse 
retinogeniculate connections’”. An example ofa trans-synaptically labelled 
RGC is shown in Fig. 4f, g. The cell has thin dendrites and looping 
arborizations and is bistratified in the On and Off sublayers of the inner 
plexiform layer (IPL) (Fig. 4g, and inset), features characteristic of 
On-Off DSGCs***°, Also, the RGC is GFP* in a transgenic mouse 
where GFP is selectively expressed in posterior-tuned On-Off DSGCs° 
(Fig. 4h-j) (Extended Data Fig. 7). Another mCherry* DSGC trans- 
synaptically labelled from superficial V1 is displayed in Fig. 4k—-m. The 
cell expresses cocaine- and amphetamine-regulated transcript (Cart), a 
marker of On-Off DSGCs'*"*. Interestingly, trans-synaptic tracing from 
superficial V1 also labelled J-RGCs, which are asymmetric Off-type 
DSGCs tuned for upward motion’ (Fig. 4n, 0). 

Of all the RGCs trans-synaptically labelled from superficial V1 we 
verified that the vast majority were DSGCs (n = 27/28 RGCs; 8 mice). 
17/28 were genetically identified as posterior-tuned On-Off DSGCs°* 
and 9/28 were Cart* but not GFP* and thus are On-Off DSGCs which 
are probably tuned to other cardinal axes of motion’*'®. 1/28 was an 
upward tuned J-RGC’, and one could not be classified. Thus, although 
we cannot conclude that only DSGCs contribute to this pathway, we 
find that several types of On-Off DSGCs as well as Off-type DSGCs 
provide di-synaptic input to superficial V1. 

Do DSGCs also feed the classic retino-geniculo-cortical pathway into 
layer 4? We addressed this by injecting mice with AG-RABV-mCherry 
into V1 layer 4 and AAV2-Glyco-hGFP into the dLGN, then exam- 
ining the RGC types trans-synaptically labelled with mCherry (Fig. 4p). 
In this regime, the mCherry’/Glyco-hGFP* double-infected neurons 
resided in the dLGN core (Fig. 4q-t) (Extended Data Fig. 8) (7 mice, 
n= 53 cells) (Fig. 4t versus Fig. 4e; ***P < 0.0001; two-tailed t-test). 
All the RGCs trans-synaptically labelled from layer 4 had large somas 
and broad, smooth monostratified dendrites, features characteristic of 
mouse alpha RGCs’*"? (Fig. 4u, v) and/or they expressed SMI-32, a 
marker of alpha RGCs"* (Fig. 4w-y) (7 mice, n = 38 RGCs) that is not 
expressed by On-Off DSGCs (Fig. 4z—bb). Moreover, none of the RGCs 
trans-synaptically infected from layer 4 exhibited DSGC morphologies, 
stratification patterns or molecular/genetic markers**’*’* (Fig. 4cc—ee) 
(0/38 RGCs examined; 7 mice). Combined with the results in Figs 2, 3, 
4a-o, these data indicate that alpha, non-direction-tuned RGCs are 


20 MARCH 2014 | VOL 507 | NATURE | 359 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


AG-RABV-mCherry/AAV2-Glyco-hGFP/DAPI 


Glyco-hGFP 


@ 404mCherry/Glyco-hGFP (%) 
30 
20 
10 


0 
0 20 40 60 80 100 
dLGN width (%) 


f -AG-RABV-mCherry g Reconstruction 


“. Off sublayer 
On sublayer (and soma) 


AG-RABV-mCherry ~~ 


‘ 


SS ear 


Figure 4 | Synaptic circuit linking DSGCs to superficial V1, and non- 
DSGCs to LA. a, Trans-synaptic tracing. b, Infected dLGN neurons. Arrow and 
arrowhead: double-infected cells; arrow is same cell as in c, d. Scale bar, 100 um. 
c, d, Cell from b. Scale bar, 15 jm. Dashed line, lateral border. e, Distribution of 
double-infected dLGN cells = 9.29 + 1.82% (8 mice, n = 21 cells). f, On-Off 
DSGC trans-synaptically labelled from superficial V1. g, On (red) and Off 
(black) dendrites. Arrowhead, axon. Scale bar, 50 um. h-j, Cell (f) is GFP* On- 
Off DSGC*. Scale bar, 10 jum. k-m, Trans-synaptically labelled GFP* and 
Cart DSGC' shown at low (left) and high (right) magnifications. m, Scale bar 
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left, 75 ttm; right, 10 pm. n, Trans-synaptically labelled J-RGC’. 0, Off dendrites 
(black). Scale, 50 jum. p, Same as a, but layer 4 injection. q-s, Infected neurons in 
core; q, arrow, arrowhead: double-infected cells. Scale, 100 um. r, s, Cell from 
q (arrow). s, Scale bar, 15 um. t, Distribution of double-infected dLGN 

cells = 70 + 2.65% (7 mice, n = 53 cells) (P< 0.0001 versus e; two-tailed 
t-test). u, v, Alpha RGC labelled from V1 layer 4. u, Scale bar, 100 um; 
sideview, 50 jim. w-y, mCherry* RGC sameas from u, v, is SMI-32*. Scale bar, 
20 pum. z-bb, SMI-32 and Cart. Scale bar, 25 1m. cc-ee, AG-RABV-mCherry~ 
alpha RGC; lacks GFP® and Cart. Scale bar, 150 um. 


poised to influence layer 4 of V1 via neurons in the dLGN core, whereas 
DSGCs are poised to influence superficial V1 via neurons in the (LGN 
shell. 

Next we asked what qualities of visual information are delivered by 
the dLGN to superficial V1. We forced a subset of dLGN neurons to 
express the calcium indicator GCaMP6” by injections of AAV2-Syn- 
GCaMP6 into the shell/DSGC-RZ (Fig. 5a—c) (5 mice). We then imaged 
the visually evoked calcium dynamics (AF/F) in thalamocortical axons 
that target superficial layers of V1 (Fig. 5d) using in vivo time-lapse 
two-photon microscopy” (Fig. 5e), while presenting the mice with 
drifting gratings of different orientations and directions (Fig. 5e-g) 
(Methods). 


Figure 5 | In vivo imaging of visually evoked Ca’* signals in 
thalamocortical axons. a, AAV2-GCaMP6 injection to dLGN shell. 

b, GCaMP6* neurons (arrows). Scale bar, 50 [im; inset, 10 um. c, GCaMP6* 
dLGN axons, superficial V1. Arrows: varicosities. Scale, 50 tum. d, GCaMP6* 
axons, superficial V1. Circles, square in d correspond to polar plots ], 0, p. 
Scale bar (d), 5 um. e, In vivo imaging/visual stimulation. f, Visually evoked 
Ca’* signal in thalamocortical axon (top trace: photodiode signal; bottom 
trace: AF/F). g, Directional stimuli (0°, 45°, 90°, 135°, 180°, 225°, 270°, 315°). 
h-j, Direction- (h, i) and orientation-tuned (j) varicosities. 5-8 trial average. 
k-s, Polar plots of F1 (red) or F2 (black) magnitude responses (Methods). Inner 
solid ring, average response to mean grey stimulus. Shaded, 3 standard deviations 
greater than the mean response to grey stimuli. Lower right of each plot, OSI/DSI. 
Upper right, Fourier amplitudes. t, DSI/OSI, all varicosities (5 mice, n = 58 
varicosities). Mean = s.e.m. u, Cumulative distributions: OSI (circles), DSI (squares). 
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Figure 5f shows an example of visually evoked calcium transients 
in a thalamocortical axon. Figure 5h, i shows two clear and marked 
examples of direction-tuned signals in dLGN axons located within 
superficial V1. The first example (Fig. 5h), responds only to gratings 
drifting at 45 degrees (polar plot Fig. 5k). The second example (Fig. 5i) 
is also direction-selective but is more broadly tuned (polar plot Fig. 51). 
Interestingly, we also observed thalamocortical axons that were strongly 
tuned not for direction, but instead for orientation (Fig. 5j and Fig. 5n-q). 
Overall, a variety of strengths of direction and orientation tuning were 
observed, including some thalamocortical axons that were not tuned 
for either feature (Fig. 5r, s). Approximately 60% of all thalamocortical 
axons imaged had direction selectivity indices (DSIs) = 0.4,a commonly 
used threshold for categorizing mouse DSGCs”! and approximately 
~85% had orientation selectivity indices (OSIs) of = 0.5 (Fig. 5t, u) 
(5 mice, n = 58 varicosities). Thus, the majority of visual information 
delivered to superficial V1 by neurons in the dLGN shell/DSGC-RZ is 
direction- and/or orientation-tuned. 

Our results are the first to define a circuit relationship between DSGCs 
and visual cortex and they suggest that, in the mouse, direction-tuned 
and orientation-tuned V1 neurons may inherit their characteristic 
receptive field properties from DSGCs. Notably, such influence may 
be specific to neurons receiving thalamic excitation within superficial 
V1. Indeed, our results of trans-synaptic labelling of RGCs from V1 
layer 4, combined with recent studies that recorded thalamic excitation 
in layer 4 (refs 22, 23), suggest that orientation and direction selectivity 
of neurons in deeper V1 arises from the convergence of retinal and 
thalamic afferents with centre-surround receptive fields, as opposed to 
direction- or orientation-tuned receptive fields. 

Interestingly, we discovered that several different types of On-Off 
DSGCs as well as upward-tuned Off-DSGCs feed the retino-geniculo- 
superficial V1 circuit. We also found that some thalamocortical axons 
are tuned for specific axes of motion, whereas others are orientation- 
tuned**”. Thus, the tuning of V1 neurons that receive input from 
neurons in the shell/DSGC-RZ may reflect the integration of indi- 
vidual or multiple DSGC types, as well as intra-cortical circuitry****. 
Together, our findings suggest that directional motion encoded at the 
level of the retina may influence the direction and orientation select- 
ivity of some V1 neurons. 


METHODS SUMMARY 


All experimental protocols were conducted according to the National Institutes of 
Health (NIH) guidelines for animal research and approved by the Institutional 
Animal Care and Use Committee (IACUC) at the University of California, San 
Diego, USA. 
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METHODS 


All experimental protocols were conducted according to the National Institutes of 
Health (NIH) guidelines for animal research and were approved by the Institutional 
Animal Care and Use Committee (IACUC) at the University of California, San 
Diego, USA. 

Mice. Pigmented, wild-type, DRD4-GFP°, Trhr-GFP® mice of both sexes (40 to 60 
days old) were used. Animals from different litters and parent mice were used for 
each experiment. Sample sizes (number of mice and/or cells) for each experiment 
are stated in main text and figure legends. 

Retrograde tracer injection. Mice were anesthetized using an isoflurane-oxygen 
mixture (4.0% (v/v) for induction and 1.5% (v/v) for maintenance) and given the 
analgesic buprenorphine (SC, 0.3 mg kg’). After a midline scalp incision, a small 
burr hole was made with a pneumatic dental drill (Henry Schein) over V1. For 
tracer injections into cortex we used either red fluorescent-latex beads (Lumafluor) 
that travel strictly in retrograde fashion from axon terminals at the injection site to 
cell bodies of origin”, or cholera toxin beta (CTB-594; Invitrogen) which travels 
both anterograde and retrograde. Tracers (200 nl volume) were injected with a 
Nanoject II (Drummond) through a small glass pipette (~0.5 MQ). No distin- 
guishable differences were observable in dLGN-labelling patterns between the two 
retrograde tracers, except that full-depth injections with CTB-594 also labelled L6 
axons (for example, Fig. 2a, d). All injections were targeted stereotaxically to V1 
using the following coordinates: 0.2-0.5 mm anterior to Lambda and 2.5 mm lateral 
from midline. For superficial injections we penetrated the cortex to a final destina- 
tion of 150 um and for deep injections to a depth of 650 um. Full-depth experi- 
ments were accomplished by injecting tracers at multiple cortical depths. 

Virus injections. For injections of AAV2 expressing histone-tagged GFP and 
rabies glycoprotein (AAV2-Glyco-hGEP, titre, 10'* genomic titre per ml) into the 
dLGN a small burr hole was made using the following stereotaxic coordinates: 
2.10 mm posterior to Bregma and 2.2 mm lateral from midline. A small glass pipette 
(~0.5 MQ) was lowered 2.50 mm from the pial surface and using a Nanoject II 
(Drummond), 0.04-0.40 pl of virus was inoculated into the brain. A 3-week wait- 
ing period was introduced post- AAV2-Glyco-hGFP injection in order to allow 
maximal expression, after which the scalp was re-incised and a second burr hole 
was made over V1 for injections of AG-RABV-mCherry. It was crucial to inject 
AAV2-Glyco-hGFP before AG-RABV-mCherry because AAVs require more time 
for expression to occur. AG-RABV expressed within 5-7 days and only dLGN 
neurons expressing both the AG-RABV and Glyco-hGFP are competent to infect 
RGCs that synapse with them. For the cortical injections of AG-RABV-mCherry, a 
pipette (~0.5 MQ) was lowered to a final destination of either 150 um (‘super- 
ficial’) or 450 um (‘deep’), and 0.04-0.40 ul of glycoprotein-deleted modified rabies 
virus encoding mCherry (AG-RABV-mCherry, titre, 8.0 10°-2.0 X 10° infec- 
tious units per ml) was inoculated. After surgeries the skin was sutured and the 
animals were allowed to recover in a heated chamber. For AG-RABV-mCherry 
infections, mice were housed in a biosafety room for 5-7 days to allow rabies virus 
to infect, trans-synaptically spread, and express mCherry in presynaptic cells. 
Virus production. The coding sequence of histone 2B-tagged GFP and rabies 
virus glycoprotein B19G linked by the F2A element was amplified by PCR with 
high-fidelity Phusion polymerase*. CMV enhancer and synapsin promoter were 
cloned by PCR”. These PCR fragments were inserted into the AAV vector con- 
taining woodchuck hepatitis virus posttranscriptional regulatory element. Every 
plasmid was sequenced before virus production. AAV were generated with trans- 
fection of HEK293t cells, purified by iodixanol gradient centrifugation and titred 
in HEK293t cells with quantitative PCR. The titre of the AAV used in this study 
was 10'* genomic titre per ml (ref. 31). AG-RABV expressing mCherry was amp- 
lified in B7GG cells, concentrated by two rounds of centrifugation, and titred in 
HEK293t cells as described previously*. The titres of the rabies viruses were 
8.0 X 10°-2.0 X 10° infectious units per ml. The viruses were stored at —80°C until use. 
Retinal histology. Five to seven days after AG-RABV-mCherry injection, whole- 
mount retinas were harvested and fixed in 4% paraformaldehyde (PFA) for 
30 min-1h, then washed in phosphate buffered saline (PBS; 3X, 30 min). The 
retinas were left for 2 h in blocking solution (10% goat serum, 0.25% Triton-X) at 
room temperature, then transferred to primary antibodies for 12-18 h at 4°C, and 
washed in PBS (3X, 30 min). The retinas were incubated in secondary antibody 
for 2h, and washed in PBS (3X, 30 min). Then, retinas were mounted and cover 
slipped with either Vectashield containing DAPI (Vector Laboratories) or Prolong 
Gold with DAPI (Invitrogen). Depending on the experiment, the following prim- 
ary antibodies were used (1:1,000 guinea-pig anti-GFP, Synaptic Systems; 1:1,000 
rabbit anti-Cart, Phoenix Pharmaceuticals; 1:1,000 mouse anti-SMI-32, Covance; 
and 1:1,000 rabbit anti- DSRed, Clontech). The following secondary antibodies were 
used (1:1,000 goat anti-guinea-pig Alexa Fluor 488; 1:1,000 goat anti-rabbit Alexa 
Fluor 647; 1:1,000 goat anti-rabbit Alexa Fluor 594; 1:1,000 goat anti-mouse Alexa 
Fluor 488; Invitrogen). 


Retinal physiology. Trhr-GFP mice (6-8 weeks old) were used for targeted record- 
ings of GFP* DSGCs using methods described previously”. GFP RGCs were 
identified using two-photon microscopy to minimize photopigment bleaching. 
Loose patch recording was used to measure spike responses to moving bar stimuli 
projected onto the photoreceptor layer from an OLED array (eMagin) with mean 
light levels of approximately 70 rhodopsin isomerizations per rod per second and 
bar contrasts of 200-300%. 

Brain histology. Following retrograde or trans-synaptic tracer injection, mice were 
deeply anaesthetized with pentobarbital and transcardially perfused with 0.9% 
saline followed by 4% PFA. Brains were removed, fixed overnight in PFA and then 
transferred to a 30% (w/v) sucrose solution and stored at 4°C. Brain slices were 
collected at 40-60 tm thickness in the coronal plane using a sliding microtome. 
Brain sections containing dLGN and visual cortex were washed with PBS and incu- 
bated for 1 h at room temperature in a blocking solution (10% goat serum, 0.25% 
Triton-X). This was followed by an overnight incubation with rabbit anti-GFP 
(1:1,000, Invitrogen) and guinea-pig anti- VGLUT2 (1:1,000, Millipore) primary 
antibodies in blocking solution at room temperature. Sections were then rinsed 
with PBS (3X, 30 min), and incubated for 2h at room temperature with Alexa 
Fluor 488 goat anti-rabbit (1:1,000, Invitrogen) and Alexa-Fluor 647 goat-anti 
guinea-pig (1:1,000, Invitrogen) secondary antibody. Lastly, sections were washed 
with PBS and mounted with Vectashield with DAPI (Vectorlabs). 

Targeted filling of genetically tagged RGCs. Targeted RGC filling was con- 
ducted using methods described previously’. Retinas from mice aged between 
postnatal day 25-30 (P25-30) mice were dissected and kept in an oxygenated (95% 
O2/5% CO,) solution of Ames’ medium (Sigma-Aldrich) supplemented with 23 mM 
NaHCOs. Borosilicate glass sharp electrodes were used to fill GFP” RGCs with 
10mM solution of Alexa 555 hydrazide (Invitrogen) in 200mM KCl, with the 
application of hyperpolarizing current pulses ranging between 0.1-0.9nA for 
5-20 min. After cell filling, the retinas were fixed for 1h with 4% PFA and were 
processed for immunocytochemistry using the methods described above. The fol- 
lowing primary and secondary antibodies were used: rabbit anti-GFP (1:1,000, 
Invitrogen), guinea-pig anti-VAChT (1:1,000, Millipore), Alexa Fluor 488 goat 
anti-rabbit and Alexa Fluor 647 goat anti-guinea-pig (1:1,000, Invitrogen). Sections 
were rinsed with PBS and mounted with Prolong Gold with DAPI (Invitrogen). 
Confocal imaging. Fluorescence images were acquired on a laser scanning con- 
focal microscope (Zeiss LSM 710 and 780) equipped with 405, 488, 561 and 
633 nm laser lines. Image stacks with a scanning resolution of 1,024 X 1,024 pixels 
were collected using the following objective lenses: X10/0.45 Plan-Apochromat 
(step size of 3.7-4.0 tm), X20/0.8 Plan-Apochromat (step size of 1.0 1m) and LD 
C-Apochromat X40/1.1 water immersion (step size of 0.5-1 um). 
Quantification. For calculating cortical intensity profiles for superficial and deep 
injections of tracers, the location of the injection was identified from epifluorescent 
images of tissue sections taken from the entire brain. We evaluated tracer spread 
along the length of the injection site (from pia to white matter) and the width 
(parallel to the pial surface). Background fluorescence was subtracted using the 
contralateral non-injected cortex as the baseline control. After background sub- 
traction, the peak fluorescence along the injection site was quantified by dividing 
the site into equidistant areas along the injection height and width from pia to the 
white matter (~800-1,000 tm). For each fluorescence intensity profile, the peak 
was identified as maximum fluorescence. The fluorescence signal was normalized 
to this value. Four to five intensity profiles were analysed per tissue section. All 
tissue sections surrounding the injection site were included in the analysis. We 
confirmed superficial injections as those with peaks = 150 1m from the pia and 
deeper injections as those with peaks >300 um from the pia. 

For measuring the location of retrogradely labelled cells along the medial-lateral 
axis of dLGN, we obtained fluorescent confocal images (10 and X20) of coronal 
brain tissue sections. The boundaries of the (LGN were identified using two stable 
features: the optic tract, which defines the lateral (LGN border, and the intrala- 
minar zone between the dLGN and the latero-posterior nucleus, which defines the 
dLGN’s medial border. Four measurements were taken per tissue section from 
locations that spanned the entire rostral-caudal length of the dLGN and thus, the 
entire retinotopic map. These measurements were placed at equidistant points 
along the width of the dLGN and an average distance of each labelled cell along the 
axis from the optic tract (0%) to the medial border (100%) was obtained. 

To quantify overlap between retrogradely infected dLGN relay neurons and 
GFP* DSGC axons we obtained fluorescent confocal stacks of coronal tissue 
sections through the dLGN. For the GFP channel a median filter (2 pixels) was 
applied and then background subtracted using a rolling ball radius algorithm (50 
pixels). After processing images, maximum intensity projections were generated 
for both channels and the GFP channel was subsequently thresholded using a 
triangular algorithm™. The thresholded image was then binarized to create a mask 
representing the complete area populated by GEP* DSGC axons. This mask was 
superimposed on the red channel. Every mCherry“ cell body was identified and its 
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location assessed for overlap with the DSGC-RZ. We note that modified rabies 
viruses may underestimate total numbers of presynaptic inputs to double-infected 
cells. 

To determine the area of contact between dLGN neurons and VGLUT2 con- 
taining DSGC axons, we quantified the percentage of the total somatodendritic 
area of the mCherry expressing neuron that co-localized with GFP signal from the 
genetically tagged On-Off DSGC, and that were also immunoreactive for VGLUT2. 
First, GFP and VGLUT2 puncta were identified by a series of filtering (median = 2 
pixels) and erosion steps. This image was then thresholded to identify regions of 
interest (ROIs) that contained VLGUT2 and GFP puncta, and this was divided by 
the total measurement of the mCherry labelled area. Images were processed and 
analysed using algorithms in ImageJ (NIH). 

For retinal co-localization experiments, the maximum intensity projections for 
confocal stacks were obtained for red (mCherry), green (GFP) and far-red channel 
(Cart). Confocal sections were background subtracted and a maximum projection 
of the stack was obtained. Channels were merged and markers were considered co- 
localized if signals from different channels coincided within the same plane. 
Statistical analysis. Analysis of fluorescent intensity profiles, normalized distances 
across the cortex and dLGN and axonal overlap of labelled relay cells and axons 
was carried out using custom routines in MATLAB (Mathworks) and Image-] 
(NIH). All statistical analyses were performed with GraphPad Prism (GraphPad 
Software), and represented as averages. Error bars in graphs represent the s.e.m., 
where n = number of cells. P value of <0.05 was used to determine significance. All 
statistical tests were judged as appropriate and described throughout the manu- 
script. P values for post hoc tests involving multiple comparisons were adjusted 
using the Tukey’s method. No randomization method was used to assign mice to 
the experimental groups and investigators were blinded to the original group 
allocation or when assessing an outcome. Variance and normality were tested to 
ensure that a suitable statistical test was chosen. The variance was similar between 
groups that were statistically compared. Normality testing in groups with a small 
number of samples was done with quantile-quantile plots. 

For anatomical tracing experiments samples sizes were determined based on 
preliminary experiments that tested the labelling/infection efficiency of retrograde 
tracers and/or viral trans-synaptic tools. For imaging experiments no statistical 
test was used to determine sample size a priori. Experiments were included in the 
study if they were deemed successful. For the trans-synaptic studies, the experiment 
was discarded if there were no rabies infected RGCs as assessed by fluorescence. 
For experiments involving two-photon imaging of axons, data belonging to non- 
visually responsive axons was not included in the experiment. In addition, mice in 
which we found no visually driven axons were also excluded from the study. 
Imaging of visually evoked calcium signals in DSGC-RZ thalamocortical axons. 
For monitoring the activity of LGN axons in V1 either the slow (GCaMP65, titre, 
3.04 X 10° genomic titre per ml) or the fast (GCaMP6F, titre, 2.96 10% genomic 
titre per ml) version of the genetically encoded Ca** sensor GCaMP6 (AAV1.Syn. 
GCaMP6.WPRE.SV40)”° were injected into the dLGN using the following coor- 
dinates: 2.10 mm posterior to Bregma and 2.2 mm lateral from midline. A small 
glass pipette (~0.5 MQ) was then lowered 2.30 mm from the pial surface and using 
a Nanoject II (Drummond) 0.04-0.20 ll of virus was inoculated into the brain. 

Three weeks after the injection of the calcium indicator, mice were deeply anesthe- 
tized using an isoflurane-oxygen mixture (4% (v/v) for induction and 1.5% (v/v) 
for maintenance) and placed in a stereotaxic frame. Following midline incision, a 
2mm diameter craniotomy (1 mm anterior to lambda, 2.5 mm lateral to midline) 
was performed in the left contralateral primary visual cortex of adult (two month 
old) C57/BI6 mice. Next, a sterile 3 mm glass coverslip was gently laid over the dura 
mater (without using agarose) and glued to the skull with cyanoacrylate glue. Black 
dental acrylic (Lang Dental) was then applied throughout the skull surface up to 
the edges of the coverslip and the skin. A custom-made titanium chamber (dia- 
meter: 16 mm, thickness: 1.3 mm) was also embedded in the dental acrylic to help 
secure the mouse onto the stage of the microscope and shield the microscope 
objective and the photomultipliers from stray light. 

Image acquisition and presentation of stimulus grating. Mice were transferred 
and head-fixed to a stage while under isoflurane anaesthesia and kept at 37 °C using 
a temperature control device (Harvard Apparatus). Anaesthesia was maintained at 
0.8-1.0% (v/v). Imaging was performed using a two-photon microscope system 
(Sutter) controlled by ScanImage software*’ written in MATLAB (MathWorks). 
Excitation light from a Mai Tai DeepSee laser (Newport Corp.) with group delay 
dispersion compensation was scanned by galvanometers (Cambridge Technologies). 
Images were collected through a X20 objective (1.0, N.A., Olympus) and the laser 
power was typically maintained well below 50 mW at the sample. GCaMP6 was 
excited at 910 nm and emission was collected with a green filter (535 nm centre; 
50 nm band; Chroma) via photomultiplier tubes (PMT) (H7422PA-40, Hamamatsu 
Photonics). Images were collected (3, digital zoom) at 8 Hz (256 X 128 pixels) or 
16 Hz (256 X 64 pixels). For each experiment the optical axis was adjusted to be 
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perpendicular to the cranial window. Imaging was stopped between trials and 
during this time slow drifts in the optical focus were corrected using a template 
image. Bleaching of GCaMP6 was not evident during experiments. Because of the 
limitation of current genetically encoded Ca*~ sensors** our experiments were not 
sensitive enough to detect single action potentials, and thus, biased to sample from 
axons with a high spiking probability. 

A visual stimulus containing a full-field two-dimensional moving grating or bar 
was generated using Psychtoolbox’””* in Matlab version 2009. The monitor screen 
was positioned approximately parallel to, and 29cm from the right eye of the 
mouse. The visual angle subtended by the monitor (+ 35° azimuth xX + 23.5° 
elevation) ensured that projective distortions were small. Both drifting sinusoidal 
gratings and square wave gratings were used. For sinusoidal gratings, each pre- 
sentation of a grating started with a grey screen for 3s, followed by the drifting 
gratings (0.02 cycles per degree (CPD), 2 Hz, 3s). In the case of square wave 
gratings, each trial started with a grey screen for 3 s, followed by the drifting bars 
(0.01 CPD, 0.33 Hz, 12.75s). Each trial consisted of presenting eight gratings of 
different orientations. Within a trial, the orientation of the grating was randomly 
interleaved. Five to eight trials were recorded to measure average axonal responses 
to different orientations. Simultaneously, we captured the on-screen luminance by 
recording the output of a photodiode (Thorlabs) placed at the corner of the screen. 
Analysis of time-lapse images and visually evoked fluorescent responses. 
Frames from time-lapse imaging were registered using template matching (nor- 
malized correlation coefficient) and alignment plugins coded in ImageJ (NIH; 
https://sites.google.com/site/qingzongtseng/template-matching-ij-plugin). To align 
images we first generated an average image from a small number of frames within a 
trial where there was minimal movement. A region of interest (ROI) was then 
selected within this average template image and was used to register all the frames 
within that repetition. 

To extract fluorescence signals, ROIs were drawn over GCaMP6* varicosities 
identified by using the mean intensity and standard deviation values of all the 
repetitions. The pixels in each ROI were averaged to estimate fluorescence cor- 
responding to a single varicosity. Calcium signals were expressed as relative fluor- 
escence changes (AF/Fy = ((F — Fo)/Fo)) corresponding to the mean fluorescence 
from all pixels within specified ROIs. 

For each experiment, stimulus temporal frequency was measured using the 
photodiode signal. Putative visually responsive axons were chosen and the calcium 
signals’ response power at the stimulus frequency (F1) and response power at twice 
stimulus frequency (F2) were measured. To gauge if the measured power was 
significant, we compared each visually evoked response power with the mean 
(grey) and the standard deviation (¢g;ey) of calcium response powers to the grey 
stimulus that immediately preceded each grating. Axons were deemed visually 
responsive, and included for further analysis, if the F1 power of the visually evoked 
calcium signal was greater than grey +30grey for at least two of the orientations 
probed. Most (4 out of 5 mice) of the calcium imaging experiments were done 
using GCaMP6S. For the experiment where we used GCaMP6F we were able to 
measure frequency doubling responses to the stimulus presentation. Axons were 
classified as ‘linear’ if at the orientation corresponding to peak F1 power, the F2 
power was smaller than the F1 power. If the F2 power was greater, the axon was 
judged as ‘nonlinear’. In experiments where we used GCaMP6F in combina- 
tion with lower temporal frequency drifting grating stimulus, we observed two 
GCaMP6" varicosities with On-Off responses (Fig. 5i and corresponding polar 
plot 51, and polar plot in Fig. 5q; and see imaging methods above). We did not 
observe differences in the percentage of varicosities that were visually responsive 
using GCaMPé6S compared to GCaMP6F (data not shown); therefore both sets of 
data are grouped in Fig. 5t, u. Power calculation was performed using the chronux 
package (mtspectrumc with default tapers; see http://chronux.org/ (ref. 39)). Tuning 
properties with regard to the orientation of the gratings were quantified by cal- 
culating a selectivity index (OSI/DSI). The OSI was defined as (Rpret —Rortho)/ 
(Rpret + Rortho), Where Rpres, the response in the preferred orientation, was the 
fluorescent response with the peak Fl power. Roiho was similarly calculated 
as the response evoked by the orthogonal orientation. DSI was defined as 
(Rpret ~ Ropp)/(Rpret + Ropp)» Where Ropp is the response in the direction opposite 
to the preferred direction. Direction tuned neurons were defined as neurons with a 
selectivity index = 0.4 and orientation selective neurons were defined as those with 
an index = 0.5. 
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Extended Data Figure 1 | The retino-geniculo-cortical pathway links retinal _ pathway also shown. b, Diagram of retinal layers: PRL, photoreceptor layer; 
cells and circuits to the brain. a, Diagram of retina, dorsal lateral geniculate _opl, outer plexiform layer; INL, inner nuclear layer; ipl, inner plexiform layer; 
nucleus (dLGN) and primary visual cortex (V1). The optic tract which GCL, ganglion cell layer; nfl, nerve fibre layer. c, Retina diagram with cells 
carries retinal ganglion cell (RGC) axons and thalamocortical (dLGN to V1) shown (labels same as in b). 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


b c d 
Fd VA VA Max. Fluorescence (%) Retrollabeled 
“1 0 0 20 40 60 80100 dLGN neurons 
= £ OT (0%) 
= 200 
L3 o ai 
E CHO 
L4 6 400 
: s Medial border 
LS = 
: &. 600 (100%) 
L6 ao 
— 800 
Extended Data Figure 2 | Approach for assessing laminar specificity of occurs at 25 jm from pial surface (4 mice). Deep, peak intensity occurs 
mouse geniculocortical projections. a, Focal retrograde tracer injectionto V1. at 350 um from pial surface. Gray shaded regions, s.e.m. (superficial vs 
Scale bar, 3 mm. b, Diagram of the three different injection depths used to deep = ***P < 0.0001; two-way ANOVA). d, Assessment of retrogradely 
generate data in Fig. 2. c, Percentage of fluorescence in V1 from superficial labelled cells across the width of the dLGN. 0% is at optic tract, 100% is at 
(black line) versus deep (grey line) injections. Superficial, peak intensity medial border (see Fig. 2g-i). 
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GFP*t DSGC axons/retrolabelled dLGN neurons 


Extended Data Figure 3 | Retrograde tracers to superficial V1 label cells in 
the DSGC-RZ. a-c, Same dLGN as in main Fig. 2f but with GFP* On-Off 

DSGC* axons shown. a, most of the retrogradely labelled cells (magenta/dashed 
circles) reside in the DSGC-RZ (green terminals). Asterisk, labelled cell outside 
the DSGC-RZ. Scale bar, 200 jm. b, c, High magnification views of retrogradely 


DSGC-RZ 


labelled dLGN neuron cell bodies with potential contact from GFP* DSGC 
axons (arrow in b); ¢, this cell is in vicinity of DSGC axonal boutons 
(arrowheads). b, ¢, Scale, 15 jum. d, Diagram of laminar-specific connections 
between DSGC-RZ and superficial V1 and dLGN core and deeper V1 
layers 4 and 6. 
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Extended Data Figure 4 | Analysis of dLGN neurons retrogradely infected 
from superficial V1. a—f, Example serial sections of anterior, middle and 
posterior portions of dLGN in a mouse with GFP expressing On-Off DSGC 
axons that was injected with AG-RABV-mCherry in superficial layers of V1. 
a, DAPI to show cytoarchitectural landmarks and dLGN borders. b, GEP* 
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DSGC axons and AAV2-Glyco-hGFP-infected cell bodies (see main Fig. 4 and 
text). c, Mask of GFP* DSGC axons (Methods). d, AG-RABV-mCherry* 
dLGN relay neurons. e, GFP" DSGC axon mask superimposed with mCherry 
signal; this was used to determine colocalization. f, mCherry and GFP signals 
merged. Scale bar, 200 um. 
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Extended Data Figure 5 | Putative sites of contact between DSGC axonsand__in b-d. Arrowhead (a), thalamocortical axon of mCherry’ dLGN cell. Scale bar 


a dLGN neuron retrogradely infected from superficial V1. a-i, GEP* in a, 50 um. Yellow boxed region in ¢, d, is shown at higher magnification in 
On-Off DSGC axons (green in all panels except black in b) and mCherry~ e-i. Scale bar in d, 15 um. e-i, Some DSGC axon-dendrite contacts contain 
dLGN relay neuron (magenta in all panels except white in c) infected by VGLUT2 (blue). f-i, Arrowhead, site of GFP/mCherry co-localization that 


injection to superficial V1. Framed region in a is shown at higher magnification does not contain VGLUT2; arrow, GFP/mCherry/VGLUT2"* contact. 
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Extended Data Figure 6 | The axons of GFP* On-Off DSGCs and dLGN 
neurons infected with AA V2-Glyco-hGFP can be distinguished on the basis 
of their cellular localization. High magnification view of DSGC-RZ in mouse 
with GFP* posterior-tuned On-Off DSGCs that was injected 14 days earlier 
with AAV2-Glyco-hGEP. Glyco-hGFP* neurons have nuclear GFP labelling 
(arrows), whereas DSGCs have GFP in axon terminals (arrowheads). 

Dashed line, lateral border of dLGN. OT, optic tract. Scale bar, 50 Lum. 
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Extended Data Figure 7 | Signature anatomical and physiological 
characteristics of GFP-tagged On-Off DSGCs. a, b, Flat-mount retina with 
GEP* On-Off DSGCs (a) and co-stained with DAPI (b). c, Positions of GEP* 
RGCs. Scale bar in ¢, 150 um. d-f, High magnification views. Scale bar, 12 um. 
g, Targeted fill of a GEP* DSGC. Scale bar, 50 um. h, Schematic of On-Off 
DSGC stratification and starburst amacrine cells (magenta). Labelling as in 
Extended Data Fig. 1. i, j, Higher magnification of framed region in g stained for 
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VACHhT (starburst amacrine processes). Asterisk, ‘looping arborizations’; 
dashed line, GFP arborization, which matches VAChT plexus. Scale bar, 10 jum. 
k, L, Side (x-z plane) views of cell in g. GEP* dendrites co-stratify with both the 
On and Off sublayers. Scale bar, 5 tm. m, Direction-tuned response of a GFP* 
On-Off DSGC targeted for recording and receptive field characterization. The 
spike count is highest for bars moving towards ~270° in the cardinal axes. 
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Extended Data Figure 8 | Injections of AG-RABV-mCherry into both DSGC-RZ marked by axons of GFP* On-Off DSGCs. c, Merged ofa, b. Scale in 
superficial and deep V1 combined with AAV2-Glyco-hGFP infection of a, 100 um. Boxed regions with arrows: two dLGN neurons; both RABV- 
dLGN core. a, mCherry* neurons in the DSGC-RZ and the core of the dLGN. | mCherry* and AAV2-Glyco-hGFP~. One or both of these cells infected their 
b, AAV2-Glyco-hGFP: many neurons throughout the dLGN, but mostly along _ presynaptic partner, the RGC shown in Fig. 4 (panels cc-ee) of the main text. 
the medial border and not in the shell/DSGC-RZ express Glyco-hGFP. Scale bar, 15m. 
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Intestinal crypt homeostasis revealed at 
single-stem-cell level by in vivo live imaging 


Laila Ritsma', Saskia I. J. Ellenbroek!*, Anoek Zomer', Hugo J. Snippert?, Frederic J. de Sauvage’, Benjamin D. Simons**®, 


Hans Clevers! & Jacco van Rheenen! 


The rapid turnover of the mammalian intestinal epithelium is 
supported by stem cells located around the base of the crypt’. In 
addition to the Lgr5 marker, intestinal stem cells have been assoc- 
iated with other markers that are expressed heterogeneously within 
the crypt base region’. Previous quantitative clonal fate analyses 
have led to the proposal that homeostasis occurs as the consequence 
of neutral competition between dividing stem cells’°. However, the 
short-term behaviour of individual Lgr5* cells positioned at different 
locations within the crypt base compartment has not been resolved. 
Here we establish the short-term dynamics of intestinal stem cells 
using the novel approach of continuous intravital imaging of Lgr5- 
Confetti mice. We find that Lgr5* cells in the upper part of the niche 
(termed ‘border cells’) can be passively displaced into the transit- 
amplifying domain, after the division of proximate cells, implying 
that the determination of stem-cell fate can be uncoupled from divi- 
sion. Through quantitative analysis of individual clonal lineages, we 
show that stem cells at the crypt base, termed ‘central cells’, experi- 
ence a survival advantage over border stem cells. However, through 
the transfer of stem cells between the border and central regions, all 
Lgr5* cells are endowed with long-term self-renewal potential. These 
findings establish a novel paradigm for stem-cell maintenance in 
which a dynamically heterogeneous cell population is able to func- 
tion long term as a single stem-cell pool. 

In the small intestine, stem cells are associated with Lgr5 expression, 
which marks around 14-16 proliferative crypt base columnar (CBC) 
cells distributed throughout the crypt base. The stem-cell niche is con- 
stituted by Paneth cells'*"’ and the surrounding mesenchyme”. Cells 
that become displaced from this region enter the transit-amplifying 
domain (TA) and lose stemness’’. Quiescent or slow-cycling cells, posi- 
tioned at or near the +4 position may constitute a second stem-cell 
type*>°"4, although a recent study indicated that some, if not all, of 
these cells represent secretory precursors that, in common with DIl1* 
cells higher in the crypt’®, can be recruited back into the stem-cell com- 
partment upon damage’®. The hierarchy, heterogeneity and spatial 
organization of intestinal stem cells remain subjects of debate'”~’. It is 
unclear whether stem and progenitor cells are organized in an engrained 
proliferative hierarchy, defined by the signature of molecular markers, 
or whether stem cells transit reversibly between states of variable com- 
petence in which they become biased towards renewal or differenti- 
ation. If the latter is true, then it remains to be determined whether that 
bias is controlled by intrinsic heterogeneity in the expression of fate 
determinants, or is the consequence of spatiotemporal cues associated 
with niche-derived signals. Although inducible genetic lineage tracing 
allows us to dissect short-term heterogeneity in self-renewal potential, 
its reliability may be undermined by transient effects due to drug-inducing 
agents, Cre activity, or non-representativeness of labelling”. Therefore 
we applied an in vivo live-imaging strategy, allowing measurements to 
begin several days after drug administration. In common with previous 


live-imaging approaches used to study stem cells in hair follicle and 
testis***°, our approach enables tracing of the fate of individual marked 
stem cells and their progeny over time in vivo. 

Multiphoton intravital microscopy and surgical implantation of an 
abdominal imaging window (AIW)**” into living Ler5°OrP tes rE RT?/ 
R26R-Confetti mice were used to obtain visual access to the intestinal 
stem-cell niche (Fig. 1a). Lgr5* CBC cells and their progeny were lineage 
traced over time (Extended Data Fig. 1) by activating the expression of 
one of the Confetti colours (membranous cyan fluorescent protein (CFP), 
cytoplasmic yellow fluorescent protein (YFP) and red fluorescent protein 
(RFP)) in individual Ler5* cells using tamoxifen-mediated recom- 
bination of the Confetti construct (Fig. la). To characterize the fate 
behaviour of CBC cells, we followed lineages of 80 marked cells (n = 4 
mice) up to 5 days from the start of time-lapse imaging (Extended Data 
Fig. 2; for controls, see ref. 27 and Extended Data Fig. 3). 

After induction, clonal progeny were observed throughout the 
stem-cell niche. To quantify the fate behaviour of Lgr5* CBC cells, 
we acquired z-stacks (Fig. 1b; see Supplementary Video 1 for the three- 
dimensional reconstruction) and classified cells based on their relative 
position, using the most basal cells (row 0) as a reference (Fig. 1b). 
Confetti-labelled clones were scored according to cell number, disag- 
gregated by position (Extended Data Fig. 4). In line with predictions of 
neutral competition’, numbers of marked cells in the stem-cell niche 
varied widely between clones (some expanded in size, others lost attach- 
ment to this compartment altogether; Extended Data Figs 2, 4). As just 
1 of the 28 clones containing a single marked CBC cell at the start of 
filming remained single after 2 days of tracing, we chose to neglect the 
potential impact of lineage-committed quiescent Lgr5* cells, which 
was identified previously’’. 

To investigate spatial heterogeneity in the self-renewal potential of 
CBC cells, we defined two regions within the Lgr5* stem-cell niche: a 
central (rows 0 to +2) and border (+3 and +4) region (Fig. 1b). A 
‘mother’ cell in either the central or the border region could expand 
and give rise to progeny that extended into both regions (Fig. 1c-fand 
Extended Data Fig. 5). Further quantitative analysis was necessary to 
address the potency of CBC cells in these two domains. Although the 
average number of central cells per clone derived from a single central 
mother cell remained approximately constant, consistent with their main- 
tenance over time, the average number of border cells derived from 
these cells increased to approximately two by day 3 (Fig. 2a). Further- 
more, maintenance of the average central cell number was achieved 
through the steady decline in the number of clones retaining at least 
one central cell (Fig. 2b), compensated by a steady increase in size of 
those that remained (Fig. 2c). Although clones derived from single bor- 
der mother cells also appeared to approximately maintain their number, 
they gave rise to a comparatively smaller number of central cells (Fig. 2a). 
The sustained increase in the number of border cells from a central 
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Figure 1 | Intravital lineage tracing of Lgr5* cells. a, Cartoon showing a 
Lgr5°FP es SreERT2/R96R-Confetti mouse with an AIW to visualize intestinal 
Lgr5* CBC cells and their Confetti progeny over multiple imaging sessions. 
b, Lateral projection of a z-stack and representative xy images of a crypt at 
indicated z-stack positions (pos.). The stem-cell (SC) niche (z0-13) is defined 
by Lgr5-GFP fluorescence. The relative position of CBC cells to the most basal 
cell (row 0) determines location in the central (row 0 to +2, which translates 
to z0-6) or border region (row +3 to +4, which translates to z7-13) of the 
stem-cell niche. a.u., arbitrary units. Scale bar, 20 um. c-f, Intravital lineage 
tracing of RFP-expressing Lgr5* CBC cells located at the centre (c, d) and 
border (e, f) regions. Grey lines indicate crypts, white lines indicate Confetti 
clones. d, f, Graphs show time evolution of spatial organization of Confetti 
clones starting 3 days after induction. Clone size is divided into central 

(light green) and border (dark green) CBC cells. Asterisk indicates clones in 
which all progeny were lost. Scale bar, 20 um. 


mother cell (Fig. 2a) indicates that these cells typically outcompete cells 
at the niche border (Fig. 2d). 

To investigate the potential basis of this positional advantage, we 
studied the development of clones with finer time resolution. Every 2 
hours, we acquired multiphoton images of crypts, followed the loca- 
tion of all green fluorescent protein (GFP)-labelled cells over time 
(Extended Data Fig. 6 and Supplementary Videos 2-4), and found 
that division of single Lgr5* cells coincides with displacement of prox- 
imate CBC cells. This suggests that cell proliferation creates competi- 
tion for space, leading to an adjustment of cell positions. Through this 
rearrangement, and independent of their division history, CBC cells 
located at the border can become passively displaced from the niche 
after division of a neighbour (Fig. 2e and Extended Data Fig. 4). 

To challenge this conclusion and address the potency of the Lgr5* 
CBC stem-cell population, we aimed to capture quantitatively the vari- 
ability seen in the lineage potential of individual cells by a biophysical 
model, involving a revision of the neutral drift dynamics model introduced 
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Figure 2 | Central CBC cells experience a short-term positional advantage 
in self-renewal potential. a—c, Clonal evolution of a Confetti cell located at the 
central or border region starting 3 days after induction. Graphs show average 
clone size (a), fraction of ‘surviving’ clones that contain at least one marked 
central (top) or border (bottom) cell (b), and average size of surviving clones 
(clones with at least one marked cell) (c). Different colours indicate different 
regions in the niche. Points show data and lines show fit to the biophysical 
model (see Fig. 3). Error bars represent standard deviation (s.d.). d, Fold 
increase in clone size over 3 days from a border or central Confetti’ CBC cell. 
Note that central stem cells have a positional advantage over border stem cells. 
Error bars represent standard error of the mean (s.e.m.); P< 0.001 obtained 
using a Mann-Whitney U test. e, Intravital images of the same crypt at 
indicated times. Note that the yellow cell is truly expelled from the stem-cell 
(SC) niche, as GFP expression was absent in the TA cell region (see charts at 
indicated time points). Scale bars, 20 um. 


previously’®, in which all stem cells were considered functionally equi- 
valent. In this new model, a periodic quasi-one-dimensional arrange- 
ment of stem cells mimicked the ‘collar-like’ geometry of the central 
and border niche regions of the crypt (Fig. 3a). To account for the mixed 
GFP expression profile seen at rows +3 and +4 (Fig. 1b), the border 
region was further subdivided into Lgr5* CBC cells and Lgr5 TA 
cells. To accommodate the range of observed dynamical behaviours in 
the stem-cell niche, we allowed for five possible ‘channels’ of stem-cell 
loss and replacement (Fig. 3a). That is, after division of a border stem 
cell, one daughter cell remains at its position while the other either 
(1) displaces a border TA cell out of the niche; (2) displaces a border 
stem cell that in turn displaces a border TA cell out of the niche; or 
(3) displaces a central stem cell that in turn displaces a border stem cell 
into the border TA cell domain. Similarly, after division of a central 
stem cell, one daughter remains at its position while the other either 
(4) displaces a border stem cell into the border TA cell region; or (5) dis- 
places a central cell that in turn displaces a border stem cell into the TA 
cell region. If we define / as the rate of transfer of border TA cells out of 
the niche, each of these five processes occur at rates PA, Pypd, Pocd, 
P4/ and P.,A, respectively, with Py + Pyb + Poe + Pop + Pec = 1 (Sup- 
plementary Notes). 

By fixing the relative rates of stem-cell division and displacement by 
the observed average clone size dependences and independent estimates 
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Figure 3 | Biophysical model of intestinal stem-cell dynamics. a, From the 
unfolded crypt caricature (left), we synthesize a quasi-one-dimensional 
biophysical model of the niche region (right) consisting of two domains: border 
and centre. To conserve cell number, cell rearrangements after stem-cell 
division displace precisely one cell from the border. To capture the range of 
lineage data, we include five channels of stem-cell loss/replacement (1-5), as 
defined in the main text. b, Cumulative size distributions of clones derived from 
a single cell in the centre (left) or border (right). Clone size is defined in both 
cases by total number of constituent cells in the centre and border regions. 
Error bars represent s.e.m. Points represent predictions of the model using 
the same parameters as those inferred from the average dependences 
(Supplementary Notes). 


of the average cell division rate, we found that the biophysical model 
can accurately predict the clone size distribution and spatial depend- 
encies observed in live imaging (Figs 2a—c, 3b and Extended Data Fig. 7). 
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More significantly, with the same parameters, the model describes 
quantitatively convergence onto the hallmark scaling behaviour reported 
using static lineage tracing assays at intermediate times’ (7 and 14 days 
after induction), as well as the predicted progression towards crypt 
monoclonality at long times® (Extended Data Figs 8, 9 and Sup- 
plementary Notes). 

To challenge the model further, we traced the recovery of stem cells 
after targeted ablation of Lgr5* cells using diphtheria toxin (DT) 
injection in mice in which the human DT receptor (DTR) fused to 
enhanced (e)GFP was knocked in at the Lgr5 locus Cas ey 
(Fig. 4a). In these mice, recovered Lgr5* cells are derived from a TA 
lineage**®. After complete depletion (Fig. 4a), we observed a low fre- 
quency of initiation and a heterogeneous pattern of recovery (Fig. 4b 
and Supplementary Video 5), suggesting sporadic transfer of cells from 
the TA zone into the stem-cell niche border. The cohesion of these 
recovered cell clusters (Supplementary Video 6) suggests clonal expan- 
sion of individual TA cells. Intriguingly, by allowing individual border 
stem cells to recolonize a depleted stem-cell niche through cell division 
uncompensated by loss, our biophysical model provided a quantitative 
prediction of cluster composition (border versus central) by size, with 
the same relative rates of stem-cell division as those found in the steady 
state (Fig. 4c and Supplementary Notes). 

Our data show that intestinal stem-cell maintenance follows from 
competition between proximate CBC stem cells for limited niche access, 
and that stem cells positioned near the niche boundary experience a 
bias towards loss and replacement, whereas stem cells remote from the 
boundary are biased towards survival. Intriguingly, a similar depend- 
ence of self-renewal potential on proximity to the niche border was 
reported in a recent in vivo live-imaging study of the mouse hair 
follicle’, suggesting that such heterogeneity may be a ubiquitous fea- 
ture of adult stem-cell populations. A recent lineage tracing study based 
on the continuous and sporadic acquisition of mutations during DNA 
replication concluded that only a subfraction of putative intestinal 
stem cells are ‘functional’*’. Our quantitative analysis of live-imaging 
data shows that central stem cells are about three times more likely 
than border cells to colonize fully a crypt in steady state, explaining why 
only a fraction of Lgr5* cells seem to retain long-term self-renewal 


Figure 4 | Recovery of stem-cell 
compartment after ablation of 
Lgr5* cells challenges model. 

a, Targeted ablation of Lgr5* cells 
in Lgr5PT*°S"? mice was induced 
by injection of DT. Shown are 
representative images pre- and 
post-ablation. Scale bars, 20 um. 

b, Recovery of Lgr5* CBC cells was 
monitored only in mice in which full 
depletion was confirmed 24h after 
DT injection. Images taken at 72h 
after depletion show representative 
crypts containing clonal clusters of 
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different sizes (n = 108 crypts in 

3 mice). Scale bars, 20 um. ¢, For all 
various clone sizes, measured spatial 
composition (border versus centre) 
of Lgr5* CBC cells in clusters (grey) 
was accurately predicted by the 
biophysical model (black). 

Error bars represent s.d. 


6 7 8 9 


All rights reserved 


potential (Supplementary Notes). Through the transfer of cells between 
the central and border regions of the niche, the dynamic and hetero- 
geneous population of intestinal stem cells is able to function long term 
as a single equipotent pool. 


METHODS SUMMARY 


All experiments were carried out in accordance with the guidelines of the Animal 
Welfare Committee of the Royal Netherlands Academy of Arts and Sciences. 
Lgr5°GrP tres-CreERT2/R96R-Confetti mice between 10 and 22 weeks old were used 
for experiments. Three days before imaging, mice were injected with 2.5-5 mg 
tamoxifen (Sigma Aldrich) to activate expression of Confetti colours. Lgr5 >" °F 
mice with an AIW received 50 gkg' DT through intraperitoneal injections. 
Intravital imaging was performed on an inverted Leica TCS SP5 AOBS two-photon 
microscope with a chameleon Ti:Sapphire pumped optical parametric oscillator 
(Coherent) equipped with a X25 (HCX IRAPO NA0.95 WD 2.5 mm) water object- 
ive and four non-descanned detectors (NDDs). Confetti colours were detected 
using sequential scanning: excitation 860 nm, NDD2 455-490 nm emission (CFP 
and eGFP); excitation 960 nm, NDD3 500-550 nm emission (eGFP and YFP), 
NDD4 560-650 nm emission (YFP and RFP). Second harmonic generation (SHG) 
signal was generated by 960 nm excitation at collagen I and detected in NDD2. z-stacks 
with 2.5 j1m z-steps of typically 70-80 images were acquired. Re-identification of 
the same crypts over multiple days was accomplished by storing the xy coordinates 
of the imaged regions using the ‘multiple position’ function in the LAS-AF soft- 
ware and using the vasculature and the typical Lgr5* crypt pattern as visual land- 
marks. Acquired images were processed using a custom-designed Visual Basic 
program and further processed with ImageJ (National Institutes of Health). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Mice. All experiments were carried out in accordance with the guidelines of the 
Animal Welfare Committee of the Royal Netherlands Academy of Arts and 
Sciences. To obtain Lgr5°OrP Tres-CreERT2/R96R-Confetti mice, R26R-Confetti’ 
mice were crossed with Ler5°1P ts CreERT? mice!, Random double heterozygous 
male mice between 10 and 22 weeks old were used for experiments. Three days 
before imaging, mice were injected with 2.5-5 mg tamoxifen (single injection; 
Sigma Aldrich) to induce activation of Cre recombinase, which in turn induced 
expression of one of the Confetti colours (membranous CFP, cytoplasmic YFP and 
REP). Nuclear GFP was also activated, but that subset of Confetti-labelled cells was 
not followed. For the targeted ablation studies, four male Lgr5°'®*S™? mice with 
an AIW received 50 ugkg~' DT through intraperitoneal injections. Depletion of 
Lgr5* cells was confirmed by intravital imaging. Mice in which Lgr5* cells were 
not completely depleted after 24h received a second DT injection. Mice were 
housed under standard laboratory conditions and received food and water ad 
libitum. 

AIW surgery. The AIW surgery was performed as described previously~*. In short, 
all surgical procedures were performed under 2% isoflurane (v/v) inhalation anaes- 
thesia. Before surgery, buprenorphine (3 jug per mouse; Temgesic, BD Pharmaceutical 
Systems) was administered intramuscularly. The left lateral flank of the mice was 
shaved and the skin was disinfected with 70% (v/v) ethanol. Next, a left lateral flank 
incision was made through the skin and abdominal wall and a purse string suture 
was placed along the wound edge. A disinfected AIW (>1h in 70% (v/v) ethanol) 
was placed glass-side down next to the mice and the ileum was placed on top. 3 M 
Vetbond Tissue Adhesive (n-butyl cyanoacrylate) was used to fix the ileum to the 
cover glass of the AIW and CyGel (BioStatus Limited) was added to diminish 
peristaltic movement. After 5 min the AIW was inverted and placed in the mouse, 
with the skin and abdominal wall placed inside the AIW groove. Then the sutures 
were tightened to stably secure the window into the animal. After surgery the mice 
were provided with food and water ad libitum. Furthermore, mice were closely 
monitored once a day before imaging for behaviour, reactivity, appearance and 
defecation. 

Equipment and settings. Intravital imaging was performed on an inverted Leica 
TCS SP5 AOBS two-photon microscope with a chameleon Ti:Sapphire pumped 
optical parametric oscillator (Coherent) equipped with a x25 (HCX IRAPO 
NAO.95 WD 2.5mm) water objective and four non-descanned detectors (NDDs). 
The NDDs collect the following wavelengths: NDD1,<455nm; NDD2, 455- 
490 nm; NDD3, 500-550 nm; NDD4, 560-650 nm. Sequential scanning was per- 
formed, exiting the tissue with 860 and 960 nm wavelengths. The Confetti colours 
were detected as follows: 860 nm, NDD2 (CFP and eGFP); 960 nm, NDD3 (eGFP 
and YFP), NDD4 (YFP and RFP). Second harmonic generation (SHG) signal was 
generated by 960nm excitation at collagen I and detected in NDD2. Scanning 
was performed in a bidirectional mode at 700 Hz and 12 bit, with a zoom of 1.7, 
512 X 512 pixels. z-stacks with 2.5m z-steps of typically 70-80 images were 
acquired. Re-identification of the same crypts over multiple days was accomp- 
lished by storing the xy coordinates of the imaged regions using the ‘multiple 
position’ function in the LAS-AF software and using the vasculature and the typical 
(Confetti) Lgr5* crypt pattern as visual landmarks. 

Multi-day intestinal stem-cell imaging. After placing the AIW, mice were kept 
under anaesthesia and placed face-down in a custom-designed imaging box in 
which isoflurane (1% (v/v)) was administered through a facemask as described 
previously”*. For the multi-day imaging sessions (Figs Ic, d, 2, 3b, 4 and Extended 
Data Figs 1, 2, 4, 5, 7), mice were imaged once a day for a maximum of 3 h, during 
which the climate chamber surrounding the microscope was kept at 32 °C. After 
the imaging session the mice were allowed to wake up to maintain their body 
temperature. After imaging, acquired z-stacks were corrected for z and xy shifts 
using a custom-designed Visual Basic software program and further processed and 
analysed using basic functions in ImageJ software (linear contrasting, blurring, 
median filtering). 

Short-term intestinal stem-cell imaging. Mice were anaesthetized using isoflur- 
ane (2% (v/v)). The left lateral flank was shaved and the skin was disinfected using 
70% (v/v) ethanol. Next, a left lateral flank incision was made through the skin and 
abdominal wall and the ileum was extracorporated using in PBS-drowned cotton 
swabs. The ileum was placed on a custom-designed inset containing a coverslip 
fitting the custom-designed imaging box. The ileum was secured to the coverslip 


using Vetbond and CyGel. The mouse was placed on top of the intestine and PBS- 
drowned sterile cotton gauzes were placed next to the animal to prevent dehyd- 
ration. ParafilmM (Sigma Aldrich) was used to cover the mouse anda subcutaneous 
infusion system was used to provide 100 ul of sterile PBS per hour. The inset was 
placed within the custom-designed imaging box in which isoflurane (1% (v/v)) was 
administered through a facemask as described earlier. The temperature of the 
mouse was monitored during imaging using a rectal probe and was kept between 
36 and 37 °C by adjusting the temperature of the surrounding climate chamber. 
Imaging was performed every 2h for 14h. z-stacks with a z-step of 2.5 um of 12 
regions with on average 6 crypts were made. Acquired z-stacks were analysed using 
ImageJ plugins (TurboReg, 3D visualization, 3D viewer). 

Intravital imaging of Lgr5*-cell-depleted mice. Mice with an AIW received 
50 ugkg ' DT through intraperitoneal injections. Depletion of Lgr5* cells was 
confirmed by intravital imaging. Mice in which Lgr5* cells were not completely 
depleted after 24 h received a second DT injection. Only mice in which full deple- 
tion was confirmed by intravital imaging 24h after the last DT injection were 
analysed. The number of Lgr5*-GEP cells within the stem-cell niche border and 
centre was determined as described in Fig. 1b. 

Real-time lineage tracing of clonal competition. The data from the lineage 
tracing were collected at random, and all clones that were imaged with a 3-day 
interval were included. The strength of a Confetti-labelled Lgr5* CBC cell to 
produce offspring was expressed as the fold increase in Confetti-labelled Lgr5* 
CBC cell number 3 days after the first imaging session. A Mann-Whitney U test 
was performed because the data were not normally distributed. 

Quantitative data analysis of multi-day lineage tracing. Lineage tracing was 
performed for 80 clones in 80 crypts from 4 mice. No sample size estimate was 
calculated before the study was executed. Only data from mice from which high 
enough quality images were acquired were included in the study. The number of 
Confetti-labelled cells per crypt position (centre (rows 0 to +2), border (rows +3 
to +4), TA (rows >4)) was scored. From the 80 lineages, we obtained 33 sub- 
lineages originating from the central region (Fig. 4a), and 47 sublineages from the 
border (Fig. 4b). 

Immune cell analysis on intestinal tissue. Six E-Cadherin-CFP/Lgr5 
mice, 22 weeks of age, were randomly divided into two groups: a control and a 
window group to test potential side effects of AIWs. AIWs were implanted on top 
of the small intestines of mice from the window group, whereas mice from the 
control group did not undergo surgery. After 24h, all mice were killed and the 
small intestines were harvested. Note that in the window group the part of the small 
intestine that was located directly behind the window was harvested. The small 
intestines were fixed for 1 day in fixation mix (1% paraformaldehyde, 0.2% NaIOu, 
61 mM NazHPO,, 75 mM L-Lysine and 14 mM NaH2PO, in H20). After fixation, 
the tissues were placed for 6 h into 30% sucrose after which the tissues were snap- 
frozen using Tissue Freezing Medium (Leice Microsystems Nussloch GmbH). 
Sixteen-micrometre sections were cut using a Leica CM3050 cryotome. A standard 
immunohistochemistry protocol was used to stain the sections with CD45 antibodies 
(BD Pharmingen, 553078, Clone 30-F11) and random areas were imaged. For 
analysis, ten areas within the imaged regions were selected and analysed in a 
blinded manner. The number of CD45-positive cells within a region was counted 
manually and an averaged number for each mouse was calculated. Next, the 
average of the three mice per group was calculated. A Mann-Whitney U test 
was performed because the sample was not distributed normally, and no signifi- 
cant differences were found. The variance between the groups was tested with an 
F-test, and was not different. 

Clone frequency window versus control mice. Eight Lgr mice, 
22 weeks of age, were divided into two groups: a control and a window group. All 
mice received 5 mg tamoxifen by intraperitoneal injection. Three days later, AIWs 
were implanted on top of the small intestines of mice from the window group, 
whereas mice from the control group did not undergo surgery at this point. Two 
days after the surgery (5 days after tamoxifen injection) all mice were imaged. In 
the control group the intestine was exteriorized before imaging (as described 
earlier). In the window group the mice were imaged through the AIW. Several 
random areas were imaged. All recorded clones were used for analysis. For a single 
clone the number of cells within the stem-cell compartment was determined and a 
frequency distribution was made for the two groups. 
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Extended Data Figure 1 | Retracing of the intravital imaging fields. a, The _ positions. b, The vasculature was used to refine the retracing. c, The retraced 
coordinates of the imaging fields within the imaging window (which alwayshas _ imaging areas were validated based on the relative position of the coloured 

a fixed position within the stage) were stored. By applying these stored crypts. Scale bars, 20 tum. 

coordinates in the subsequent imaging sessions, we recovered the same 
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Extended Data Figure 2 | Real-time lineage tracing of individual clones. 

a, Expression of a Confetti colour was induced in Ler5* CBC cells, and their 
progeny was followed over time. Two examples of Confetti-labelled Lgr5* CBC 
cells expanding over time are shown to illustrate clonal competition at indicated 
times. The top images show the continuous expansion of a YFP-expressing 
Lgr5* CBC cell (yellow line) in the central region of the crypt. The bottom 
images show the size ofa clone derived from an RFP-expressing Ler5* CBC cell 
in the central region of the crypt, which first increases and then declines 

(red line). The Confetti-labelled cells are outlined by a white dashed line and the 


crypts by a grey dashed line. Scale bars, 20 jum. Right, graph shows the increase 
in the number of YFP- or RFP-expressing Lgr5* CBC cells in the entire 
stem-cell compartment of the crypts shown in the images on the left (yellow 
and red line, respectively). b, A total of 80 crypts were imaged in four mice. 
The total clone size (border and central) of Confetti* cells present in the 
stem-cell niche independent of Lgr5 expression (row 0 to +4, which translates 
to z0 to 213 of the z-stack) was analysed at the indicated times (n = 4 mice) for 
all 80 lineages. The first time point translates to several days after induction. 
Every graph represents one crypt. 
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Extended Data Figure 3 | Clonal dynamics are unaffected by AIW surgery. 
a, Abdominal imaging windows (AIW) were surgically implanted into the 
abdominal wall of E-Cadherin-CFP/Lgr5°? "CF" mice. To detect 
CD45* immune cells, tissue sections of the small intestine of these and control 
(no AIW) mice were stained with CD45 antibodies. Scale bar, 20 um. 

b, Quantification of images in a. Regions of interest within the stromal area of 
the small intestines were measured and the number of CD45* cells was counted 
within those regions. The bars show averages and s.e.m. (n = 10 regions per 
mouse performed in 3 mice per condition). ¢, Five days after tamoxifen 
injection, the clones of Lgr5“"P *** “"F®™/R26R-Confetti mice were imaged 
intravitally. In control mice, the intestine was exteriorized before imaging and, 
in window mice, a window was placed 2 days before imaging. The frequency 
of clones with a certain size is plotted in the graph. The lines show the mean 
and s.e.m. (n = 4 mice per condition). 
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Extended Data Figure 4 | The spatial distribution of Confetti clone 
expansion within the stem-cell niche. a, b, Spatial distribution of confetti 
clones, subdivided into clones starting in the central (a) or border (b) region 
(n = 4 mice). Top left panel (a) shows the relative Confetti* cell position within 
the stem-cell niche, where row 0 to +4 translates to z0 to z13 of the z-stack. 
Every table shows the number of Confetti-labelled cells that are present at the 
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different positions in the crypt, independent of Lgr5 expression. The 
position within the central and border region of the stem-cell niche and TA 
compartment are colour coded. The hash symbol indicates the presence of 
Confetti-labelled cells in the TA compartment. NA indicates time points for 
which data was not available owing to crypts that were not retraceable at that 
specific time point. 
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Extended Data Figure 5 | Average clone size and survival probability. organization of the 80 lineages of progeny of the Confetti-labelled CBC cells 


Cartoon shows the division of the intestinal stem-cell niche into a centraland (same clones as in Extended Data Figs 2, 4) (n = 4 mice). For each lineage, we 
border region. The central region contains rows 0 to +2, which translates toz0 recorded the number of cells per clone in the central and border regions of the 
to z6 of the z-stack, and the border region contains rows +3 and +4, which stem-cell niche at the indicated times. The first time point translates to several 


translates to z7 to z13 of the z-stack (see Fig. 1b). Confetti expression was days after induction. The asterisks indicate clones in which all progeny were lost 
induced in Lgr5* CBC cells. The panels show, at the indicated time, the spatial from the niche region. 
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Extended Data Figure 6 | Lgr5* CBC cells move and can become expelled _ highlighted. The moving cells are indicated with a different colour. The location 
from the stem-cell niche. Top, maximum projection images (z2 to z5) of a of the cells at earlier time points are shown by a lighter colour and arrows 
time series of a crypt. In the lower cartoons, the Ler5* CBC cells are indicate the direction of movement. Scale bars, 20 um. 
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Extended Data Figure 7 | Cumulative clone size distributions derived from 
two border mother cells. Cumulative clone size distributions of clones derived 
from two neighbouring cells in the border region. In each case, the size of the 
clone is defined by the total number of constituent cells in the central and 
border regions (rows 0 to +4), independent of GFP expression. Note that the 
cumulative clone size distribution records the fraction of clones that have a size 
larger than the given value. The bars represent measurements from individual 
lineages reconstructed from live imaging at days 1 (a, n = 13; b, n = 35; 

c, n= 35), 2 (a, n= 10; b, n = 25; c, n= 28), 3 (a,n =7; b, n= 14;¢,n= 14) 
and 4 (a, n = 6;b, n = 8; ¢,n = 6). Error bars denote s.e.m. The points represent 
the predictions of the biophysical model using the same parameters as those 
inferred from the short-term live-imaging assay (for details, see Fig. 3a and 
Supplementary Notes). 
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Extended Data Figure 8 | Longer-term clonal evolution and the approach to 
scaling behaviour. a, b, Cumulative clone size distribution at 7 days (a) and 
14 days (b) after induction, showing the percentage of clones that have a 
constituent number of Lgr5* CBC cells larger than the given value. For 
example, the data point at around four cells (40%) in a shows that, at 7 days 
post-induction, some 40% of clones have a size larger than four Lgr5* cells, and 
so on. The bars reproduce the findings of a fixed clonal assay using the same 
Lgr5-Confetti mouse construct used in the present study and previously 
described’. Error bars denote s.e.m. Points represent the predictions of the 
biophysical model using the same parameters as those inferred from the short- 
term live-imaging assay (see Fig. 3 and Supplementary Notes). To account for 
the prolonged activity of Cre recombinase in the static clonal assay, in both 
cases we introduced a 1-day time delay. At the 14-day time point, the measured 
clone size distribution and model prediction are beginning to converge onto the 
universal scaling behaviour characteristic of a strictly one-dimensional neutral 
drift dynamics (line) (for details, see Supplementary Notes). 
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Extended Data Figure 9 | Long-term drift towards monoclonality of 
labelled crypts. Predicted frequency of monoclonal crypts over time expressed 
as a percentage of surviving clones after pulse labelling of stem cells at clonal 
density. Points show the predictions of the biophysical model defined in the 
main text and Supplementary Notes using the same parameters as those 
inferred from the short-term live-imaging assay (see Fig. 3 and Supplementary 
Notes) after the representative marking of stem cells at the crypt base, and the 
line shows the predictions of the strictly one-dimensional neutral drift 
dynamics model introduced previously*, with a stem-cell loss replacement rate 
of 0.24 per day and a total of 8 stem cells. The convergence of these two model 
predictions at longer times shows that, first, the behaviour of the quasi-one- 
dimensional model approaches that of the strictly one-dimensional model at 
longer times and, second, that the effective loss/replacement rate and stem-cell 
number in the new model is essentially fixed by the rate P..j = 0.24 per day and 
the eight stem cells that occupy the central region. Significantly, these 
parameters translate to the ratio P..A/Notem? = 0.026 per week, very close to the 
figure of 0.025 per week obtained from a fit of the measured monoclonal crypt 
fraction to the one-dimensional neutral drift dynamics model in ref. 8. (For the 
labelling protocol and the experimental data points, we refer to the original 
reference.) 
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IL-35-producing B cells are critical regulators of 
immunity during autoimmune and infectious diseases 


Ping Shen'*, Toralf Roch'}*, Vicky Lampropoulou', Richard A. O’Connor’, Ulrik Stervbo', Ellen Hilgenberg’, Stefanie Ries', 

Van Duc Dang’, Yara Jaimes!, Capucine Daridon’?, Rui Li*, Luc Jouneau’, Pierre Boudinot*, Siska Wilantri', Imme Sakwa', 
Yusei Miyazakit, Melanie D. Leech”, Rhoanne C. McPherson”, Stefan Wirtz®, Markus Neurath®, Kai Hoehlig', Edgar Meinl’, 
Andreas Grittzkau', Joachim R. Griin!, Katharina Horn’, Anja A. Kiihl®, Thomas Dérner’?, Amit Bar-Or’*, Stefan H. E. Kaufmann’, 


Stephen M. Anderton? & Simon Fillatreau! 


B lymphocytes have critical roles as positive and negative regulators 
of immunity. Their inhibitory function has been associated prim- 
arily with interleukin 10 (IL-10) because B-cell-derived IL-10 can 
protect against autoimmune disease and increase susceptibility to 
pathogens’. Here we identify IL-35-producing B cells as key players 
in the negative regulation of immunity. Mice in which only B cells 
did not express IL-35 lost their ability to recover from the T-cell- 
mediated demyelinating autoimmune disease experimental auto- 
immune encephalomyelitis (EAE). In contrast, these mice displayed 
a markedly improved resistance to infection with the intracellular 
bacterial pathogen Salmonella enterica serovar Typhimurium as 
shown by their superior containment of the bacterial growth and 
their prolonged survival after primary infection, and upon second- 
ary challenge, compared to control mice. The increased immunity 
found in mice lacking IL-35 production by B cells was associated 
with a higher activation of macrophages and inflammatory T cells, 
as well as an increased function of B cells as antigen-presenting cells 
(APCs). During Salmonella infection, IL-35- and IL-10-producing 
B cells corresponded to two largely distinct sets of surface-IgM* 
CD138"'TACI*CXCR4*CD1d'"Tim1™ plasma cells expressing 
the transcription factor Blimp1 (also known as Prdm1). During EAE, 
CD138* plasma cells were also the main source of B-cell-derived IL-35 
and IL-10. Collectively, our data show the importance of IL-35- 
producing B cells in regulation of immunity and highlight IL-35 
production by B cells as a potential therapeutic target for auto- 
immune and infectious diseases. This study reveals the central role 
of activated B cells, particularly plasma cells, and their production 
of cytokines in the regulation of immune responses in health and 
disease. 

The inhibitory activities of B cells involve their production of IL-10, 
which in mice can protect from autoimmunity but impair resistance 
to infection® *. Such suppressive function could be relevant to human 
diseases. A defect in IL-10 secretion by B cells was observed in patients 
with multiple sclerosis and type 1 diabetes”*. Furthermore, B-cell deple- 
tion therapy had deleterious effects in some patients with multiple 
sclerosis or ulcerative colitis”’, and led to ulcerative colitis or psoriasis 
in patients with Graves’ disease or rheumatoid arthritis, respectively’. 
These effects were probably not all owing to a loss of IL-10-producing 
B cells. Mouse B cells could inhibit immunity independently of IL-10 
(refs 13, 14). However, no mediator to account for this has been char- 
acterized. There is an urgent need to identify additional factors medi- 
ating regulatory functions of B cells. 


B cells require activation to exert suppressive activity and Toll-like 
receptors (TLR) are critical in this process. Mice with deficiencies in 
both TLR2 and TLR4 restricted to B cells developed an exacerbated 
EAE after immunization with the encephalitogenic peptide from mye- 
lin oligodendrocyte glycoprotein (MOG35_55)'°. Using mice with sin- 
gle deficiencies in these TLR restricted to B cells (B"? “ and B™"™4 
mice, respectively), we found that TLR4 was the most critical for B-cell- 
mediated suppression in EAE (Fig. la and Extended Data Fig. 1a). 
With our previous studies’, these results establish TLR4 and CD40 as 
receptors essential for the regulatory function of B cells in EAE. CD40 
also contributes to the protective roles of B cells in ulcerative colitis 
and arthritis**. 

IL-10 production by B cells is required for recovery from EAE’. 
Naive B cells produced IL-10 after TLR4 engagement, but not upon 
co-stimulation by TLR4 plus CD40 (Fig. 1b and Extended Data Fig. 1b). 
To identify additional suppressive factor(s) produced upon TLR4 plus 
CD40 stimulation, we performed Affymetrix array analyses on (1) naive 
B cells, (2) B cells activated by TLR4 and (3) B cells activated by TLR4 
plus CD40, and focused on genes coding for secreted molecules (Extended 
Data Fig. 1c). Among the genes differentially expressed, of interest was 
Epstein-Barr virus-induced gene 3 (Ebi3), a member of the IL-12 cyto- 
kine family that can dimerize with p28 or p35 to generate IL-27 or IL-35, 
respectively, which both have suppressive functions'*””. B cells did not 
express p40 (also known as J/12b) messenger RNA, highlighting its cell- 
type specific expression pattern”, but constitutively transcribed p35 
(also known as J112a) (Extended Data Fig. 1d). In fact, B cells were the 
main source of p35 mRNA in secondary lymphoid tissues (Fig. 1d). 
B cells upregulated expression of p35 and Ebi3 mRNA, as well as EBI3 
protein upon activation by TLR4 plus CD40, which was further 
increased upon B-cell receptor for antigen (BCR) engagement, suggest- 
ing they could secrete IL-35 (Fig. le and Extended Data Fig. 1). This 
was confirmed by co-immunoprecipitation using supernatants from 
TLR4 plus CD40-activated B cells (Fig. 1f). We conclude that B cells 
can secrete IL-35 after activation by TLR4 plus CD40. 

To evaluate the role of IL-35 expression by B cells during EAE, we 
used mice with a B-cell-restricted deficiency in p35 (B? aa? or EBI3 
(Bee), or control mice with wild-type B cells (BY7). We also used 
mice in which B cells could not express p40 (B? cae or p28 (BP are, 
because p35 can dimerize with p40 to form IL-12, and EBI3 can assoc- 
iate with p28 to form IL-27. B cells secreted p28 after activation 
(Extended Data Fig. 1j). B?* ” and B*’*” mice developed exacer- 
bated EAE, whereas B? 40" and BP?8 ” mice had disease courses similar 
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Figure 1 | B cells secrete IL- 35 upon activation by TLR4 and CD40. 

a, EAE was induced i in BU?” (grey squares, n = 8), B™4" (black triangles, 
n = 8) andB’ mice (grey circles, n = 16). Data show clinical EAE scores from 
two independent experiments (mean + s.e.m.). Cumulative disease scores were 
compared using two-tailed unpaired t-test. b, Splenic B cells from IL-10-eGFP 
(knock-in of eGFP at I110 locus) mice were stimulated for 72h with LPS 
(lug ml ') or LPS (1 ug ml") plus anti-CD40 (10 ugml—'), and eGFP 
expression was measured by flow cytometry. Plots show eGFP expression by 
live CD19 cells. Results are representative of three independent experiments. 
c, Hierarchical cluster analysis of secreted factors differentially expressed 
between B cells activated with LPS or LPS plus anti-CD40 (Pearson correlation 
with average linkage). Affymetrix microarrays were performed in 
quadruplicates. Gene expression levels are shown for each array compared to its 
average value for all arrays, within a scale from twofold increase (yellow) to 
twofold decrease (blue). d, p35 mRNA expression was quantified by RT-PCR 
in lymph nodes and spleen from naive C57BL/6 and B-cell-deficient JHT mice, 
as well as in B cells purified from lymph nodes and spleen of C57BL/6 mice. 
Data show compilation of three independent experiments (mean + s.e.m.). 

e, Splenic B cells were activated as indicated for 72 h, and treated with GolgiStop 
for the last 4h of culture. B-cell lysates were separated on SDS-PAGE gel and 
blotted with anti-EBI3 or anti-actin antibody. Data shows a representative 
result from three independent experiments. f, B cells from C57BL/6 or p35- 
deficient mice were activated for 72 h with LPS plus anti-CD40. Culture 
supernatants were subjected to immunoprecipitation with anti-p35 followed by 
western blot with anti-EBI3 antibody. Data shown are representative of two 
independent experiments. 


= EBI3 


o BT controls (Fig. 2a and Extended Data Fig. 2). Therefore, 


B cells limited EAE pathogenesis through provision of IL-35. EAE 
pathogenesis involves Ty71 and Ty17 cells, which express IFN-y and 
IL-17, respectively”’”, B°* “mice displayed increased MOG-reactive 
IFN-y and IL-17 production compared to control mice (Fig. 2b and 
Extended Data Fig. 2d). In contrast, B°* ” and B?””” mice mounted 
normal T-cell responses (Fig. 2b and Extended Data Fig. 2).B’*” "mice 
had more CD4* T cells and mononuclear phagocytes in the central 
nervous system than BT mice (Extended Data Fig. 2), suggesting that 
B-cell-derived IL-35 limited disease by reducing the accumulation of 
pathogenic cells in the target organ. These data demonstrate that B-cell- 
derived IL-35 is a critical regulator of T-cell-mediated autoimmunity. 

The mechanisms underlying the suppressive activities of IL-35 remain 
poorly understood’. During EAE, the increased T-cell nesponve observed 
in BP?” mice was not due to a defect in CD4* Foxp3* T es aoa | 
(Treg) cells (Extended Data Fig. 2), which are protective in this disease™*. 

B cells from B’* mice expressed higher levels of activation markers 
(CD44, CD69) and molecules involved in antigen presentation to CD4* 
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Figure 2 | IL-35 6s spans by B cells is required for recovery from EAE. 
a, EAE was induced in B?** ’ (grey squares, n = 17) and BYT 
(black circles, n = ~ 16) (left panel); Bees’ (grey squares, n = 9) and 
corresponding BT mice (black circles, n = 5) (middle panel); peo’ 

(grey squares, n = 10) and BY? mice (black circles, n = 16) (right panel) by 
immunization with MOG3;s_;5 peptide in Complete Freund’s adjuvant. Data 
show clinical EAE scores from two independent experiments (mean + s.e.m.). 
Cumulative disease scores were compared using a two-tailed unpaired t-test. 
b, Splenocytes were collected from mice on day 10 after EAE induction, 

and pooled before re-stimulation for 48 h with MOG3;5_55 in increasing 
concentrations. Culture supernatants were analysed by ELISA to determine 
IFN-y and IL-17 concentrations. Data show a representative result from two 
Hide penaeat experiments. c, EAE was induced in B? 35°" and corresponding 
BYT mice by immunization with MOG35_s5 peptide in Complete Freund’s 
adjuvant. B cells and CD4*CD25~ T cells (Teff) were isolated from pooled 
draining lymph nodes and spleens on day 10 after immunization. Then 5 X 10° 
B cells from B??° ” or B™ mice were cultured with 1 X 10‘ Teff cells from 
BP>” or BT mice in the presence of MOG35-55 in increasing concentrations, 
as indicated. Proliferation was assessed after 64h by [*H]thymidine 
incorporation. CPM, counts per minute. Data show representative results from 
two independent experiments. d, Supernatants from cultures as described in 
c were harvested after 48 h, and analysed by Bio-Plex to determine the 
concentrations of IL-17 and GM-CSF. Data shown (mean = s.e.m.) are pooled 
from two independent experiments. b-d, Graphs show mean + s.e.m.. Results 
were compared using a two-way ANOVA followed by a Bonferroni post-test 
(*P < 0.05, **P < 0.01; ***P < 0.001). Results of comparison are shown for 
BP?” versus BY” (b), and for B“" B cells plus BY” Teff versus B??> ” 

B cells plus B”” Teff (d). 


mice 


T cells (MHC-II, CD80, CD86), compared to control B cells (Extended 
Data Fig. 3). Accordingly, they were more potent APC, stimulating 
higher proliferation and production of inflammatory cytokines (IL-17 
and GM-CSF) by MOG-reactive CD4* T cells than control B cells 
(Fig. 2c, d and Extended Data Fig. 3c). Preliminary studies showed 
that B’*> " mice were susceptible to EAE induced with human MOG, a 
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disease dependent on pathogenic functions of B cells”. These findings 
introduce IL-35 as a regulator of the APC function of B cells. 

To test the role of B-cell-derived IL-35 in infection, we challenged 
Bes BFP and BP?” mice with the Gram-negative bacterium 
Salmonella enterica serovar Typhimurium (hereafter referred to as 
Salmonella). This intracellular pathogen causes a disease in mice that 
resembles typhoid fever in humans, responsible for approximately 


20 million cases and 600,000 deaths annually”. B’ 35" and Babs” 
mice displayed longer survival than B? 40" and BY" mice after prim- 
ary infection, and upon secondary challenge (Fig. 3a). This improved 
resistance correlated with better control of the bacterial burden in spleen 
and liver, increased accumulation of macrophages in these organs, and 
stronger inflammatory T-cell responses towards Salmonella (Fig. 3b, c 
and Extended Data Fig. 4). In contrast, B-cell-derived IL-35 had no 
effect on Treg frequencies, global frequencies of activated T cells or 
humoral immunity against Salmonella (Extended Data Figs 4 and 5a). 
B?*>” mice also mounted normal antibody responses against a hapten- 
protein antigen (Extended Data Fig. 5b). Consistent with the role of 
CD40 in IL-35 production by B cells (Fig. 1), mice with a B-cell-restricted 
deficiency in CD40 displayed enhanced control of Salmonella infection 
(Extended Data Fig. 5c). These data demonstrate that B cells can inhibit 
anti-microbial immunity through production of IL-35. 

B cells can also inhibit anti-Salmonella immunity through IL-10 (ref. 6). 
To identify IL-10- and IL-35-producing B cells, and clarify their rela- 
tionship, we quantified Ebi3 and [110 mRNA in CD19*CD138" B cells 
and CD138" plasma cells during Salmonella infection (Fig. 4a and 
Extended Data Fig. 6). Ebi3 and 110 were exclusively induced in 
CD138" cells (Fig. 4a). Single-cell PCR analyses indicated that distinct 
sets of CD138" cells expressed the mRNA for 110 or for both IL-35 
subunits Ebi3 and p35 (Fig. 4b). We therefore characterized further these 
plasma cells. They expressed uniform surface levels of lgM, CD80, CD86, 
MHC-II, CD40, CD69, CD44, CD43, TACI and CXCR4, as well as 
intermediate levels of CD1d (also known as Cdld1) and Tim1 (also 
known as Havcr1) (Extended Data Fig. 6a, d), yet three subsets could be 
distinguished according to CD138 and CD22 levels: CD138'"CD22", 
CD138"CD22* and CD138"CD22° cells (Extended Data Fig. 7). These 
subsets differed by their capacity to produce antibodies, and expressed 
distinct amounts of mRNA for the transcription factors driving plasma 
cell development (Blimp1, IRF4), or maintaining B-cell identity (Pax5) 
(Extended Data Fig. 7), demonstrating that they corresponded to dif- 
ferent stages of plasma cell development. The expression levels of [110 
and Ebi3 mRNA in these subsets correlated with their degree of matur- 
ity, and were highest in the most differentiated CD138"CD22° cells 
(Extended Data Fig. 7c). CD138"CD22~ plasma cells were mostly 
located at the interface between red and white pulp in spleen, in clusters 
also containing T cells, CD11b* and CD169* myeloid cells (Extended 
Data Fig. 7d). Single-cell PCR analyses revealed that 6-10% of CD138™ 
cells expressed [110 mRNA, and a similar frequency made mRNA for 
both IL-35 subunits, whereas few cells co-expressed these three tran- 
scripts together (Fig. 4b). Nearly all CD138™ cells transcribing 110 or 
both IL-35 subunits co-expressed Blimp1 (Fig. 4c), as expected for 
plasma cells. The less mature CD138'""CD22~ population contained 
little 1110 or Ebi3 mRNA (Extended Data Fig. 7c), and only rare cells 
contained J/10 mRNA (Fig. 4b), suggesting that expression of IL-10 
and both IL-35 subunits were acquired during plasma cell maturation. 
These data indicate that distinct sets of plasma cells provide IL-10 and 
IL-35 during Salmonella infection. 

Plasma cells from mice infected with Salmonella consistently secreted 
more IL-10 than B cells upon ex vivo stimulation (Fig. 4d and Extended 
Data Fig. 8). Around 70% of CD138™ plasma cells upregulated IL-10 
expression after stimulation with phorbol 12-myristate 13-acetate plus 
ionomycin, a classical treatment for identifying cytokine-producing 
cells (Extended Data Fig. 8). CD138* plasma cells were also the major 
B-cell subtype expressing the proteins EBI3 and p35 (Fig. 4e). In contrast, 
CD138" plasma cells did not secrete IL-6, a mediator of pro-inflammatory 
functions of B cells”’”, unlike CD19 CD138 B cells (Extended Data 


368 | NATURE | VOL 507 | 20 MARCH 2014 


@ eB -» Bes” @BWT  -» Bebis’ - BYT  -» Beso 


P< 0.0001 P< 0.0001 NS 
100+ 100+ 1004 
= 80+ g 80+ g 804 
3 60-4 3 60+ 3 604 
= 40+ 5 404 = 40, 
® 20+ B 204 B 204 
0- 0- 04 
0 5 10 15 0 5 10 15 0 5 10 15 
Days post-infection Days post-infection Days post-infection 
100 100 100 
= 80 3 804 = 804 
wm 60 = 607 mz 60 
2 2 2 
2 40 2 407 = 407 
= me = 
® 20; ® 204 ® 204 
04 P< 0.0001 04 P=0.0030 04 NS 
0 10 20 30 40 50 0 10 20 30 40 50 0 10 20 30 40 50 
Days post-rechallenge Days post-rechallenge Days post-rechallenge 
b 8S 5 
oT p=0.0079 3 P<0.0001 4 NS 
fe) 
of6 3 
£6 2 aes 
ao 
ao 4 2 
2a 1 
$32 1 
10 65 
10° 10' 10? 109 10g £ oll, oth, oth, 
Ly6C ——> Ss 5 BWT Bess BWT penis’ BWT peso 
Cc 10 
: BWT IFN-y* CD4+ T cells CD40L* CD4+ T cells 
if ee gs Oe a ee E 15, 00225 , 0.0304 
; 0.0419 | 0.0281 rs 7 fe 
ie 9 20 g 
10rq9 Tot 102 10° 10% & 154 5 r x 
f fal 
Oo 
6) 
- 10 © 5,= L 
+ 
es 1 
8 i oO 2 0 
108 7 F SK SK So SX SX So 
102 10' 102 103 10* Sef $s 3 SF SH S$ 
IFN-y S "Sf MS  “g °S 


Figure 3 | B-cell-derived IL-35 enhances susceptibility to Salmonella. 

a, Top panel shows survival curves of B°>” (n = 14) and corresponding 
BT mice (n = 16), B”? : (n = 12) and corresponding BY? mice (n = 13), 
Beto’ (n = 13) and corresponding BY" mice (n = 14) after infection with 
virulent Salmonella (SL1344). Data pooled from two independent experiments. 
Survival curves were compared using Wilcoxon test. Bottom panel shows 
survival curves of BP?> (n = 15) and corresponding BYT mice (n = 16), 
Bebis’ (n = 15) and corresponding BY" mice (n = 15), BP”? ‘ (n= 11) and 
corresponding B™" mice (n = 14) vaccinated with attenuated Salmonella 
(SL7207) and 90 days later re-challenged with virulent Salmonella (SL1344). 
Data are pooled from two independent experiments. Survival curves were 
compared using Wilcoxon test. b, Representative FACS plot of mononuclear 
phagocytes (MP) gated as CD11b*Ly6C™ cells among live B“” splenocytes 
at day 6 post-infection (p.i). with SL1344 (left panel); frequencies of MP 

per spleen at day 6 p.i. in BP*? “BEI” and BP!” mice together with 
their corresponding B™” controls (right panel). Numbers of mice: B’*° ‘ 
(n= 6) and BY" (n=8), BP?” (n=7) and B™” (n=5), BP?” (n= 8) 
and B™” (n = 8). Data are pooled from two independent experiments. 
Graphs show mean + s.e.m. ¢, Mice were infected with attenuated Salmonella 
(SL7207). After 21 days, bone marrow cells were stained for CD4 and CD40L 
or IFN-y after 6h re-stimulation with heat-killed Salmonella. Representative 
FACS plots of IFN-y* cells among CD4* T cells from BT and B’** “ mice 
(left panel). Frequencies of IFN-y* and CD40L* cells among CD4" T cells, 
(middle and right panels, respectively). Data are pooled from two independent 
experiments with total number of mice: B’*® “ (n= 12) and corresponding 
BYT (n= 10), BP? : (n = 8) and corresponding BY (n =9), BP? : (n= 15) 
and corresponding B“" (n = 9). b, c, Data were analysed using two-tailed 
unpaired t-test with Welch’s correction in case of unequal variance. P values 
>0.05 are considered not significant (NS). 


Fig. 8). This lack of IL-6 production, which may reflect a repressive 
effect of Blimp] (ref. 28), distinguishes IL-10- and IL-35-expressing 
plasma cells from IL-10-producing CD1d™ B cells (Extended Data 
Fig. 8). CD138" plasma cells therefore have a remarkable propensity 
to express anti-inflammatory cytokines but not IL-6 during Salmonella 
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Figure 4 | IL-10 and IL-35 are expressed by CD138" plasma cells during 
Salmonella infection. a, Splenic plasma cells (PC) (CD138") and B cells 
(CD19*CD138' ) were isolated from C57BL/6 mice on days 0, 1, 3, 5, and 8 
after infection with 10’ c.f.u. attenuated Salmonella (SL7207). Ebi3 and I110 
mRNA expression was quantified by RT-PCR. Data show fold induction 
of Ebi3 (left) and 110 (right) in PC and B cells compared to naive B cells. 
A compilation of five independent experiments is shown (mean = s.e.m.). 
b, Single CD138'"CD22*, CD138"'CD22*, and CD138"CD22~ cells were 
sorted by FACS from C57BL/6 mice on day 3 after infection with 10’ c.fu. 
attenuated Salmonella (SL7207). A total of 208 CD138'"CD22*, 206 
CD138"'CD22* and 189 CD138*CD22° cells expressed B-actin, and were 
included for analysis. Data show percentages of I110* (left), p35" Ebi3* 
(middle), and I110* p35" Ebi3* (right) cells among each subset. c, Blimp1 


mRNA expression in cells analysed in (c) was also detected by single-cell PCR. 


Data show percentages of Blimp1™ cells among 1/10" (left), and p35" Ebi3* 
cells (right). d, CD138" PC and CD19*CD138° B cells were isolated from 
spleens of C57BL/6 mice on day 3 after infection with attenuated Salmonella 
(10’ c.f.u.). Naive B splenic B cells were isolated from unchallenged C57BL/6 
mice. Cells were activated for 24h with PMA/ionomycin and IL-10 was 
determined in culture supernatants by Bio-Plex. Data shown is pooled from 
5 independent experiments. Results were compared using a two-tailed 
unpaired t-test. e, Splenic CD19*CD138" B cells and CD138* plasma cells 
were isolated from spleens of C57BL/6 (WT) and p35 /~ Ebi3-'~ p40-/~ 
mice on day 3 p.i. with attenuated Salmonella (10’ c.f.u.). Proteins were 
separated on SDS-PAGE gel and detected with anti-EBI3, anti-p35 or 
anti-actin antibodies. Data show results from two independent preparations 
for WT samples, and one preparation for p35" Ebi3 ‘~p40~'~ B and 


plasma cells. 


LETTER 


infection, emphasizing the regulatory potential of plasma cells com- 
pared to other B cell subsets. 

Our study identifies IL-35-producing B cells as critical regulators of 
immunity. At a time window during Salmonella infection when both 
B-cell-derived IL-10 (ref. 6) and IL-35 exerted suppressive functions, 
plasma cells were the major B-cell type expressing these cytokines. This 
was also the case during EAE (Extended Data Fig. 9). During Salmonella 
infection, IL-10 and IL-35 were made by distinct sets of plasma cells, 
suggesting that these two suppressive axes can operate in parallel. In 
line with this, mice in which individual B cells could express either 
IL-10 or IL-35, but not both cytokines, displayed a normal EAE course 
(Extended Data Fig. 9). Accordingly, B and plasma cells could produce 
IL-10 without concomitant IL-35 secretion (Extended Data Fig. 9). 
Future studies shall assess whether ‘regulatory plasma cells’ producing 
IL-10 and IL-35 (but not IL-6) can provide novel opportunities for 
immune intervention. 


METHODS SUMMARY 

Ble pire pss Bevis’ pedo” p28 and BYT mice were generated 
using a previously described mixed bone marrow chimaera approach”, as detailed 
in the Methods, and outlined in Extended Data Fig. 10. This protocol allowed normal 
reconstitution of the B-cell compartment in the resulting chimaera (Extended Data 
Fig. 10). EAE was induced in mice by immunization with MOG35_55 emulsified in 
complete Freund’s adjuvant (Sigma-Aldrich). The disease score and the MOG35_55 
specific T-cell responses (at days 10 and 28) were assessed as described previously’. 
Mouse B cells were purified and activated as described before’’. Cytokine secretion 
was determined by Bio-Plex or ELISA. Whole transcriptome analyses were per- 
formed with RNA purified from naive and activated B cells, using GeneChip 
Mouse Genome 430 2.0 Arrays. For mRNA quantification, total RNA was isolated 
with TRIzol (AMS Biotechnology) and cDNA was prepared with the Reverse Trans- 
cription Kit (Promega). Quantitative RT-PCR was performed with LightCycler 
FastStart DNA Master SYBR Green I (Roche). Western blots to determine EBI3, 
p35 and actin expression were performed using anti-EBI3 (rabbit M75 polyclonal 
Ab), anti-p35 (rabbit EPR5736 polyclonal Ab), and anti-actin (rabbit I-19 poly- 
clonal Ab), respectively. For immunoprecipitation, IL-35 was captured with anti- 
p35 antibody (clone C18.2; eBioscience) followed by precipitation using HMACS 
Protein G microbeads (Miltenyi Biotech). Infections with virulent and attenuated 
Salmonella (strains SL1344 and SL7207, respectively) were performed as pub- 
lished®. Flow cytometry analyses were performed as previously described®. Other 
protocols are described in the Methods. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

Mice, immunization and infection. C57BL/6, Tir2’", Tlr4 ", Ebi“, p28’ am 
p35’, p40’, Cd4o-’, 1110’, 1110’ p35’, p35’ Ebi3 ‘p40 '~, JHT, 
IL-10-eGFP®, and p35’ IL10-eGFP mice were bred under specific pathogen- 
free conditions. B!"2", BU”, BPs pes Reb peso” pcddo’” ang 
BO mice were obtained by a previously described mixed bone marrow chi- 
maera approach using lethally irradiated C57BL/6 or JHT mice as recipients*°. 
Briefly, B? 35" mice were obtained by reconstituting recipient mice with a mixture 
of bone marrow cells from B-cell-deficient JHT mice (80%) and p35-deficient mice 
(20%). Control B“ chimaera were obtained using a mixture of bone marrow cells 
from JHT mice (80%) and C57BL/6 mice (20%) for experiments shown in Fig. la, 
Fig. 2a left panel, and Fig. 2a, right panel. For all other experiments (including 
Fig. 2a middle panel, and Extended Data Fig. 2a), corresponding B” control 
chimaera were obtained as outlined in Extended Data Fig. 10, by reconstituting 
irradiated mice with a mixture of bone marrow cells from C57BL/6 mice (80%) and 
from mice deficient in the gene of interest (20%). The various control chimaera are 
called ‘corresponding B” mice’ throughout the study. EAE was induced by 
immunization with MOG3;_55 peptide, or recombinant human MOG extracellular 
domain emulsified in Complete Freund’s adjuvant (Sigma-Aldrich) and pertussis 
toxin (Sigma-Aldrich), and assessed as previously described’. Mice with a weight 
loss >20% were humanely euthanized according to ethical regulations. To mea- 
sure the MOG-reactive T cell response, 8 X 10° splenocytes were stimulated in flat- 
bottomed 96-well plates with different concentrations of MOG35-s55 peptide for 
2 days, and culture supernatants were then analysed by ELISA to determine con- 
centrations of IFN-y and IL-17 (coating/detection with clones R4-6A2 and XMG1.2 
for IFN-y, and TC11-18H10 and TC11-8H4 for IL-17). Mice were infected intra- 
venously with 100 c.f.u. Salmonella strain SL1344, and 10° or 10’ c.f.u. attenuated 
Salmonella strain SL7207, and assessed as previously described, including for 
histological analyses*. All experiments were reviewed and approved by appropriate 
institutional review committees (University of Edinburgh ethical review commit- 
tee, Comite d’Ethique Midi-Pyrenees, and LAGeSo Berlin), and were conducted in 
accordance with UK, French and German legislations, in compliance with European 
community council directive 68/609/EEC guidelines. Mice were of C57BL/6 strain, 
6-12-weeks old at the start of experiments, and of male and female genders. EAE 
experiments and Salmonella infections were performed in a blinded manner, and 
identities of the mice were revealed upon termination of the experiment. No 
randomization was used. Estimation of size groups was based on our previous 
experience with these disease models, without a priori determination by power 
calculation. 

B-cell purification and activation. B cells were obtained by magnetic isolation 
using negative selection with anti-CD11b, anti-CD11c, and anti-CD43 microbe- 
ads (Miltenyi Biotec). B cells (>99% pure) were activated as previously described’ 
at 5 X 10° cells per well in 96-well flat-bottom plates with LPS (Escherichia coli 
serotype 055:B5; Sigma-Aldrich), CpG-ODN-1826 (TIB MolBiol, Germany), PGN 
(Streptomyces species 79682; Sigma-Aldrich), agonistic anti-CD40 antibody (clone 
FGK-45, produced in house), mouse CD40L-expressing L47 cells (L47-CD40L"*), 
or control L5 cells (L5-ctrl) as indicated. Culture supernatants were collected at 
72h, and IL-10 concentrations measured by Bio-Plex (Bio-Rad). For the micro- 
array experiments, B cells purified as described above were further depleted of 
possible contaminants by another round of magnetic negative selection after re- 
labelling with anti-CD11b, anti-CD11c, anti-CD43, anti-CD90, and anti-DX5 
microbeads (Miltenyi Biotec). B cells were activated with LPS (1 ug ml~!) or 
LPS (1g ml ') plus anti-CD40 (10 pg ml '). Dead cells were eliminated from 
the 24h and 72h activated culture by labelling with propidium iodide (PI), and 
sorting on FACS Diva (BD Biosciences). Cells were lysed in RLT buffer (Qiagen) 
and total RNA was extracted using RNeasy Mini Kit (Qiagen). 

Gene array hybridization and data analysis. CRNA were hybridized on Affymetrix 
MG 430 2.0 arrays, using standard Affymetrix protocol after quality control with 
Agilent 2100 Bioanalyzer and quantification with NanoDrop ND-1000 spectro- 
photometer, as previously described”. The significantly differentially regulated 
genes were detected using a t-test based R-script, with P values adjusted using 
Benjamini Hochberg procedure. In order to be selected in a comparison of two 
conditions, each Affymetrix ID had to fulfill the following criteria: (1) be present in 
at least three of the four arrays for at least one of the two conditions compared, (2) to 
have a mean signal intensity higher than 50 in at least one of the two conditions, 
and (3) to show an adjusted P value <0.01 (t-test) in the comparison of the two 
conditions. The genes differentially expressed between TLR4-activated and TLR4- 
activated plus CD40-activated B cells (t-test; P< 0.01), and differentially modu- 
lated during B-cell activation (t-test; P< 0.01), were then selected and filtered using 
the gene ontology resource (http://www.geneontology.org) to focus on secreted 
molecules (Extended Data Fig. 1c). Seven genes fulfilled these criteria, among which 
five were uniquely increased in TLR4 plus CD40-stimulated B cells (Extended Data 
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Fig. 1c). Hierarchical clustering was performed with the MeV program (version 
4.8.1)°° using Pearson correlation and average linkage. 

Analysis of mRNA expression by B and plasma cells. Sorted B and plasma cells 
were lysed in TRIzol, and RNA was prepared (AMS Biotechnology). After DNase 
treatment (Ambion), RNA was reverse-transcribed with a Reverse Transcription 
System (Promega). Quantitative RT-PCR was performed on an MX3005P QPCR 
System (Stratagene), with LightCycler FastStart DNA Master SYBR Green I (Roche). 
Transcripts were quantified using B-actin as standard, and the following forward 
(FP) and reverse (RP) primers (MWG Biotech): B-actin FP: 5'-TGGAATCCTG 
TGGCATCCATGAAAC-3’, B-actin RP: 5’-TAAAACGCAGCTCAGTAACAG 
TCC-3’; EBI3 FP: 5’-CGGTGCCCTACATGCTAAAT-3’, EBI3 RP: 5'-GCGGAG 
TCGGTACTTGAGAG-3’; p35 FP: 5’- CATCGATGAGCTGATGCAGT-3’, p35 
RP: 5’-CAGATAGCCCATCACCCTGT-3’; IL-10 FP: 5'-AGCCGGGAAGAC 
AATAACTG-3’, IL-10 RP: 5’-CATTTCCGATAAGGCTTGG-3’; Blimp1 FP: 
5'- GGCATTCTTGGGAACTGTGT-3’; Blimp1 RP: 5’- GACAGAGGCCGAGT 
TTGAAG-3’; IRF4 FP: 5’- GCAGCTCACTTTGGATGACA-3’; IRF4 RP: 5’-CC 
AAACGTCACAGGACATTG-3’; Pax5 FP: 5’-AACTTGCCCATCAAGGTGTC-3’; 
Pax5 RP: 5'-CTGATCTCCCAGGCAAACAT-3’. 

Western blot and immunoprecipitation. Activated B cells were treated with 
GolgiStop (BD Biosciences, Germany) to block protein secretion during the last 
4h of stimulation. B cells (2 X 10”) were lysed with 500 yl RIPA buffer (Thermo 
Fisher Scientific, USA) supplemented with protease inhibitors (Thermo Fisher 
Scientific, USA). Cells isolated from mice infected with Salmonella were directly 
lysed in RIPA buffer containing protease inhibitors. Protein concentrations of 
lysates were determined using the BCA Protein Assay Kit (Thermo Fisher Scientific, 
USA). Proteins were separated on a polyacrylamide gel and transferred to a PVDF 
membrane (Bio-Rad Laboratories, USA) using semi-dry blotting. EBI3, p35 and 
actin were detected using rabbit anti-EBI3 (M-75 polyclonal IgG, Santa Cruz 
Biotechnology, USA), rabbit anti-p35 (EPR5736 polyclonal IgG, Abcam, UK), 
or rabbit anti-actin (I-19 polyclonal Ab, Santa Cruz Biotechnology, USA) primary 
antibody, and horseradish peroxidase (HRP)-conjugated secondary anti-rabbit 
antibody (cat. number: 81-6120, Invitrogen, USA; or cat. number: 111-035-144, 
Jackson ImmunoResearch, USA) with ECL (GE Healthcare, UK) as HRP substrate. 
The chemiluminescence signal was measured using the Image-Reader LAS-3000 
(Fujifilm, Japan). For immunoprecipitation, supernatant from B cells activated 
with LPS plus anti-CD40 (clone FGK-45, 10 pg ml!) were incubated overnight 
at 4°C with 2 1g ml anti-p35 (C18.2, eBioscience). Immunoprecipitation was 
performed using hMACS Protein G Microbeads (Miltenyi Biotech), followed by 
immunoblot to detect EBI3. 

B-T cell co-cultures. The protocol for B-T cell co-cultures was adapted from a 
previous report”. Briefly, B cells were magnetically sorted from pooled spleens and 
lymph nodes of B??> or BY mice on day 10 post-EAE induction as CD19* 
cells (~98% pure), and Teff cells were FACS-sorted from the CD19-depleted 
fraction as previously described?!. 50 X 10* B cells and 1 X 10* CD4*CD25~ 
T cells (Teff) were then co-cultured in the indicated combinations in presence 
of increasing concentrations of MOG;s5_55. After 48h cultures received 1 Ci 
(H]thymidine, and [*H]thymidine incorporation was measured 16h later with 
a Top-Count NXT liquid scintillation counter (Perkin Elmer). Before addition of 
(H]thymidine, samples of culture supernatants were collected to quantify con- 
centrations of IL-17, IFN-y, GM-CSF and IL-6 using Bio-Plex (Bio-Rad). 
Plasma cell purification. Plasma cells and B cells were obtained from C57BL/6, 
p35’, p35 ’ Ebi3 ’ p40 ‘~, IL-10-eGEP, and p35‘ IL-10-eGFP mice on 
day 3 after infection with 10” colony forming units (c.f.u.) Salmonella (SL7207) 
by magnetic isolation using anti-CD138-PE (clone 281-2, BD Pharmingen) and 
anti-PE microbeads (Miltenyi Biotec). The negative fraction was then subjected 
to FACS sorting to obtain high purity CD19*CD138" B cells. The positive fraction 
was then stained for CD22 (clone OX-97, BioLegend), and subjected to FACS 
sorting to obtain high purity plasma cells (CD138"), and plasma cell subsets 
(CD138"™CD22*, CD138"CD22* and CD138"CD22° cells). For western blot, 
CD138" and CD138'™ cells were isolated as CD138" plasma cells from infected 
mice by magnetic isolation using anti-CD138-PE (clone 281-2, BD Pharmingen) 
and anti-PE microbeads (Miltenyi Biotec). 

Single-cell PCR analysis. Single cells were sorted on a FACS Aria II (BD Biosciences) 
into a 96-well PCR plate, immediately frozen in liquid nitrogen and stored at 
—80°C until further use. For detection of respective transcripts a two-step PCR 
approach was used. Reverse transcription and the first PCR step were carried out in 
a one-step reaction using the Qiagen OneStep RT-PCR kit according to the man- 
ufacturer’s instructions. As recommended in these instructions, specific nested 
primers (MWG Biotech) were used as follows: EBI3nested FP: 5’-CCTTCATT 
GCCACTTACAGG-3’, EBI3nested RP: 5'-TAATCTGTGAGGTCCTGAGC-3’; 
p35nested FP: 5’'-CATTCTAGACAAGGGCATGC-3’, p35nested RP: 5’-GTGAT 
GGGAGAACAGATTCC-3’; IL-10nested FP: 5’-TCTTACTGACTGGCATGAG 
G-3'; IL-10nested RP: 5’-CTTCTACCAGGTAAAACTGG-3’; Blimp] nested FP: 
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5'- CGTGAAGTTTCAAGGACTGG-3’; Blimp 1nested RP: 5'- GITGGTGGAAC 
TCCTCTCTGG-3’. For validation of sorting, B-actin primers were added to the 
reaction mixture. After this first reaction, an aliquot of the PCR product was loaded 
on an agarose gel, and only B-actin positive samples were considered to further 
analysis. A 100-fold dilution of the PCR product was subsequently used as template 
for the second PCR reaction using the primers described in the section: analysis of 
mRNA expression by B and plasma cells. Amplification of the respective transcript 
was verified on an agarose gel. 

Histology. For immunostaining, 1- to 2-j1m sections of formalin-fixed, paraffin- 
embedded tissue were cut, deparaffinized and subjected to a heat-induced epitope 
retrieval step. Slides were rinsed in cool running water and washed in Tris-buffered 
saline (pH 7.4) before incubation with primary antibodies against CD3 (cat number 
N1580, Dako, Glostrup, Denmark, dilution 1:10), and F4/80 (clone BM8, eBioscience, 
dilution 1:50) for 30 min, followed by biotinylated donkey anti-rat or donkey anti- 
rabbit (cat. numbers 712-065-153 and 711-065-152, Dianova, Hamburg, Germany) 
secondary antibodies. The streptavidin-AP kit (cat. number K5005, Dako) was 
used for detection. As negative control, primary antibody was omitted. 
Detection of serum antibodies by ELISA. To quantify Salmonella-specific or 
NP-OVA-specific antibodies, diluted sera were incubated on 96-well plates coated 
with 5 X 10° heat-killed S. typhimurium per well or 50 pg ml~' NP(15)-BSA (Bio- 
search Technologies), respectively. Detection was done with alkaline phosphatase- 
conjugated anti-IgM (cat number 1020-04), anti-IgG (cat number 1030-04), anti-IgG1 
(cat number 1070-04), anti-IgG2b (cat number 1090-04), anti-IgG2c (cat number 
1079-04), and anti-IgG3 (cat number 1100-04) antibodies (all from Southern Bio- 
technology Associates). 

Detection of mouse IL-27 p28 subunit by ELISA. The concentration of mouse 
IL-27 p28 subunit was quantified using the mouse IL-27 p28 Quantikine ELISA Kit 
(cat number M2728, R&D Systems), according to the manufacturer’s instructions. 
Isolation of CNS-infiltrating cells. CNS single-cell suspension was obtained by 
collagenase digestion of brain and spinal cord. Infiltrating leukocytes were sepa- 
rated from debris and tissue cells using Ficoll centrifugation. Cell numbers were 
determined by FACS. Cells were characterized by flow cytometry after staining for 
CD4 (clone RM4-5), CD11b (clone M1/70), Ly6G (clone 1A8) and Foxp3 (clone 
FJK16 s). 

Treg suppression assay. CD4*CD25" T cells (Treg) were isolated from B? ees 
and B“" mice on day 10 post-EAE induction, and compared for their capacity to 
suppress the proliferation of CD4*CD25~ T cells (Teff) isolated from naive 
C57BL/6 mice in vitro, as previously described*’. 

ELISPOT Assay. Sorted B cells (CD19*CD138~), and plasma cell subsets 
(CD138'™CD22*, CD138"CD22*, and CD138"CD22° cells) or total plasma 
cells (CD138") were seeded at a starting number of 10° cells per well, with seven 
successive threefold serial dilutions, in 96-well flat-bottom EIA/RIA high-binding 
plate (Millipore) pre-coated with anti-mouse Ig(H+L) chain (5 j1g ml ~ !. Southern 
Biotechnology Associates, cat number 1010-01). After 3 h incubation, plates were 
washed, and incubated with alkaline phosphatase-conjugated anti-IgM or anti- 
IgG antibodies overnight at 4 °C (Southern Biotechnology Associates, cat numbers 
1020-04 and 1030-04). ELISPOT were then developed using BCIP/NBT substrate 
(Gene Tex Inc.). 

Stimulation and characterization of plasma cells and B cells. Purified plasma 
cells and B cells were seeded at 5 X 10° cells per well on 96-well flat-bottom plates 
and stimulated with PMA plus ionomycin, LPS (1 pig ml; Escherichia coli ser- 
otype 055:B5; Sigma-Aldrich), agonistic anti-CD40 antibody (10 jig ml‘; clone 
FGK-45, produced in house), anti-IgM (5 ug ml | cat number 115-006-075, 
Jackson ImmunoResearch Laboratories, USA), IL-4 (20 ng ml ';R&D Systems 
GmbH), IL-5 (20 ng ml~ I. eBioscience), IL-6 (20 ng ml R&D Systems GmbH), 


IL-21 (20 ng ml R&D Systems GmbH), and indicated combinations. After 24h, 
supernatants were harvested for the detection of IL-10 and IL-6 concentration by 
Bio-Plex (Bio-Rad, USA); cells were subjected to flow cytometry for the quantifica- 
tion of GFP-positive cells. Characterizations were done by flow cytometry after stain- 
ing for CD19 (clone 1D3, BD Pharmingen), CD138 (clone 281-2, BD Pharmingen), 
IgM (cat number 1020-02, SouthernBiotech), IgD (clone 217-170, BD Pharmingen), 
CD80 (clone 16-10A1, BD Pharmingen), CD86 (clone GL1, BD Pharmingen), MHCII 
(clone M5/114, in house), CD40 (clone 3/23, BD Pharmingen), CD69 (clone H1.2F3, 
eBioscience), CD44 (clone 1M7, in house), CD43 (clone $7, BD Pharmingen), 
CD28 (clone E18, Biolegend), TACI (clone 8F10, BD Pharmingen), CXCR4 (clone 
2B11, BD Pharmingen), CD1d (clone 1.B1, in house) and Tim1 (clone RMT1-4, 
Biolegend). 

Validation of chimaeras. B’’ “and corresponding B“’” chimaeras were used as 
an example to demonstrate the validity of the chimaera system, that is, that the 
deficiency in the gene of interest is restricted to B cells, and complete in that com- 
partment in B* ” mice. To this end, B cells, CD11b* cells and CD4* T cells were 
purified from B®’? and corresponding B“” chimaeras. Briefly, splenocytes 
from B*3” and BY? mice were subjected to CD19 positive selection using 
magnetic cell sorting. The CD19” fraction was subsequently stained for B220 
(clone RA3-6B2), CD11b (clone M1/70), CD11c (clone N418), CD4 (clone GK1.5), 
and CD8 (clone 53.6.72), and FACS-sorted for B220'CD11b CD1lce CD4 CD8~ 
cells, yielding >99% pure B cells. The CD19 fraction obtained from the mag- 
netic sort was stained, and used to FACS-sort CD4*B220°CD11b cells, and 
CD11b‘B220 CD4° cells, resulting in > 97-98% pure CD4* T cells and 
CD11b* cells, respectively. All sorted populations as well as the remaining non- 
T/B/CD11b* cells (denoted as non-T, B, M cells) were lysed and genomic DNA 
was extracted using the DNeasy Blood & Tissue Kit (Qiagen), according to the 
manufacturer’s instructions. The amounts of genomic DNA present in these sam- 
ples were equilibrated by real time PCR using the Mog gene as standard with the 
primer pair: forward 5'-AGGAAGGGACATGCAGCCGGA-3'; reverse 5’-CTGC 
ATAGCTGCATGACAACTG-3’. Wild-type Ebi3 gene was then amplified by 
conventional PCR with the following primers: forward 5'-AACCTCAGGCCA 
GGCAGT-3’; reverse 5'-TTCCGTAGGCCATGTAGGAC-3’, in order to test 
for the presence of the wild-type Ebi3 allele in the different cell fractions. 
Additional reagents. Antibodies used in this study also included anti-Ly6C (clone 
AL21, cat number 557359, BD Pharmingen), anti-IFN-y (clone XMG1.2, BD 
Pharmingen), anti-CD154 (cat number 130-092-105, Miltenyi Biotech), anti-CD62L 
(MEL14, in house). 

Statistics. Statistical analysis was performed using GraphPad Prism (version 5.02 
for Windows, GraphPad Software, USA). EAE data distribution did not differ 
from normal distribution, as evaluated using Kolmogorov-Smirnov test. Equality 
of variances between groups was assessed before analyses by ANOVA or t-test. 
Groups were compared using ANOVA, two-tailed t-test, or Wilcoxon test, as indi- 
cated in figure legends. The t-tests were modified using Welch’s correction in case 
of unequal variance. One-way and two-way ANOVA were followed by Bonferroni 
post-test. No samples were excluded from analysis. Statistical analysis of the gene 
array data are described in the section gene array hybridization and data analysis. 


29. Biesen,R. etal. Sialic acid-binding lg-like lectin 1 expression in inflammatory and 
resident monocytes is a potential biomarker for monitoring disease activity and 
success of therapy in systemic lupus erythematosus. Arthritis Rheum. 58, 
1136-1145 (2008). 

30. Saeed, A. |. et a/. TM4: a free, open-source system for microarray data 
management and analysis. Biotechniques 34, 374-378 (2003). 

31. Hoehlig, K. etal. Activation of CD4* Foxp3* regulatory T cells proceeds normally 
in the absence of B cells during EAE. Eur. J. Immunol. 42, 1164-1173 (2012). 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


b 
1500 we. 700 * 1600 T 
_ 1250 600. a. 1400 pom 
= soon. = = 500 1200 aaal 
2 2 2 1000 
= 750 S300 = 800 
i=} © 300 So 
%, 500 ay *, 600 
- = = 200 = 400 
ty | 250 100: 200 
3 u 1S 0 0 ) 
= Sis RPMI0.0080.040.2 1 2 10 RPMI0.0080.040.2 1 2 10 RPMI0.0080.040.2 1 2 10 
é S Peetu) PGN (g/m) CpG (ug/ml) 
x 
1500 1400 1750 
1250 _.. 1200 _. 1500 
E 4000 = 1000 E 1250 
= 2 ow ton , 
Days post EAE induction ° © 600 S 750 
“, 500 on = 400 = 500 
-TLR2* FE B-TLR4* = ia & = =! 
~@-B-WT I B-TLR2 250 ee 200 250 
0 0 0 
RPMIO 0.040.2 1 2 5 10 RPMIO 0.040.2 1 2 5 10 RPMIO 0.0402 1 2 5 10 
LPS (2 pg/ml) + aCD40 (g/ml) PGN (2 ug/ml) +aCD40 (pg/ml) CpG (2 pg/ml) + «CD40 (ug/ml) 
c 
d 
LPS 
(e") 4750 = Naive B cells 
1500 ma 24h LPS 
2 = 72hLPS 
p<0.01 p<0.01 @ 1250 mm 24hLPS + aCD40 
o ms 72HLPS + aCD40 
= 1000 
= 750 
LPS + «CD40 fa 
500 + 
Genes significantly regulated Genes significantly regulated Genes differentially expressed Genes differentially expressed 250 i al. | 
upon stimulation with upon stimulation with between cells activated with between cells activated with (0) IO [1 " 
LPS LPS + aCD40 LPS or LPS + aCD40 for 24h LPS or LPS + aCD40 for 72h p35 EBi3 p40 
(n=9,703) (n=10,456) (n=1,459) (n=2,935) 
Union of Intersection 
——— 
e 
CpG+aCD40 
Genes significantly regulated Genes differentially expressed between B cells <i CpG 
during stimulation with activated with LPS or LPS + aCD40 || EGN e600 7 
LPS or LPS + «CD40 for both 24h and 72h PGN 
(n=12,933) (n=436) LPS+oCD40 4 
i LPs} 
Ne Intersection ) 
es aCD40 4 
CpG+acD40 
CpG 
| PGN+acD40 
x PGN 
12,516 Gr) ging 
aCD40 
Naive B cell | 
O 1 2S 4 8 6 FT 
Fold change p35 mRNA 
Filtering on Gene Ontology IDs : f 
CpG+aCD40 4 
Cytokine activity (GO:0005125) CpG 
Chemokine activity (GO:0008009) i= PGN+aCD40 4 
Hormone activity (GO:0005179) Q PGN 
Growth factor activity (GO:0008083) LPS+aCD40 4 
| LPS 
aCD40 + 
1418126_at (CCL5) CpG+acD40 4} 
1422924 at (TNFsf9) CpG 
1449222 at (EBi3) PGN+aCD40 
1450297_at (IL-6) 2 PGN 
1417925_at (CCL22) a LPS+aCD40 A 
1450330_at (IL-10) ae : 
g 1420353_at (Lta) acD40o 4 
20 Naive B cell 
5 01234567 89 101112 
Bis Fold change EBi3 mRNA 
> 
® 10 i j 
ey 
a p28 
© 0.5 200 
$ : ' | \ \ \ C57BL/6 
a , EBi3 el o--—wa | | | | | Mi p28 
= 150 ' i \ 
OPEL LS SS of FS PSS SS z ‘a ee ee 
Pe ae ae eae | | | Hl H 
2 S 2 2 2 i \ | | | 
h stl e wee Ls fu | i | we 
q | H | | H \ 
5 s s . b oa 4 = 
7 50 | | | | | \ 
® 600 H | H | 
8 Le it UE | iA | 
oO 0 Sos ee L ! H oe ee 
400 RPMI LPS LPS+ CpG CpG+ PGN PGN+ acp40 
a aCD40 aCD40 acD40 
$ 200 
S 
£0 
N A © A ct Ah 
OPEL LH SF SF S 
o 
sees 
~ 
RS) 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Figure 1 | Gene array analysis and IL-35 secretion by B cells 
activated in vitro. a, EAE was induced in B!”? ” (squares), pire’ (triangles) 
and B™” (circles) mice (see Fig. 1). Animals showing a body 

weight loss >20% and/or sustained front leg impairment were killed in 
accordance with ethical regulations. Survival curves were compared using 
Wilcoxon test. b, In the top panels, splenic B cells from C57BL/6 mice were 
stimulated with lipopolysaccharides (LPS), peptidoglycan (PGN) or CpG 
DNA oligonucleotides for 72h, as indicated, and IL-10 concentrations in 
supernatants were determined by Bio-Plex. Results from stimulated cells were 
compared to unstimulated B cells (RPMI) using one-way ANOVA followed by 
Bonferroni post-test (***P < 0.001; **P < 0.01; *P < 0.05; mean + s.e.m.); 
in the bottom panels, splenic B cells were simultaneously activated with 
anti-CD40 (clone FGK-45; in indicated amounts) and LPS (2 ug ml ~ , 

PGN (2 pg ml ') or CpG (2 pg ml"). Supernatants were collected after 72h 
and analysed for IL-10 content by Bio-Plex. Data shown are pooled from three 
independent experiments (mean = s.e.m.). Results from B cells stimulated 
with LPS, CpG or PGN alone were compared to B cells co-stimulated with 
anti-CD40 using one-way ANOVA followed by Bonferroni post-test 

(***P < 0.001; **P < 0.01; *P < 0.05). c, Affymetrix MG 430 2.0 whole 
genome arrays were performed in quadruplicates for naive B cells, B cells 
activated by TLR4 for 24h and 72h, and B cells activated by TLR4 plus CD40 
for 24h and 72h (20 arrays in total). To obtain genes significantly regulated 
upon stimulation with LPS, the expression profiles of naive B cells, B cells 
activated with LPS for 24h, and B cells activated with LPS for 72h were 
compared to each other, generating three lists of differentially expressed 
Affymetrix IDs (referred to as genes in this figure), whose union provided a set 
of n = 9,703 Affymetrix IDs differentially expressed during LPS activation. 

A similar analysis yielded a set of n = 10,456 Affymetrix IDs differentially 
expressed during LPS plus anti-CD40 activation. The union of these two sets 
was computed to obtain the list of Affymetrix IDs modulated during B-cell 
stimulation with LPS or LPS plus anti-CD40 (nm = 12,933). To focus on 

genes differentially expressed between B cells activated with LPS or LPS plus 


anti-CD40, the expression profiles of B cells activated with LPS or LPS plus 
anti-CD40 for 24h were compared to each other. A similar analysis was done 
for B cells activated for 72 h. The intersection of these two sets provided the list 
of Affymetrix IDs differentially expressed between LPS-activated and LPS- 
activated plus anti-CD40-activated B cells at both 24h and 72h (this was 
done to identify the Affymetrix IDs with long-term differential expression) 

(n = 436). The intersection of the set n = 12,933 with the set n = 436 gave a list 
of 417 Affymetrix IDs, which was further filtered on Gene Ontology to extract 
the Affymetrix IDs corresponding to secreted factors, yielding a final list of 

7 genes. d, Signal intensities of p35, Ebi3 and p40 mRNA expression were 
calculated using the values of the Affymetrix arrays. Data show mean = s.e.m. 
e, f, Splenic murine B cells were stimulated with LPS (1 pg ml '), PGN 

(10 pg ml’), CpG ODN 1826 (1 pg ml’), and anti-CD40 (clone FGK-45; 
10 ug ml ~ 4, alone or in combination, as indicated, for 24h or 72 h. p35 (e) and 
Ebi3 (f) mRNA expression was then quantified by real-time PCR. Data compile 
three independent experiments (mean + s.e.m.). For statistical analysis, one- 
way ANOVA test followed by Bonferroni post-test was performed comparing 
each activated B-cell sample to naive B cells (***P < 0.001; **P < 0.01; 
mean + s.e.m.). g-i, Splenic murine B cells were stimulated for 72h with 
anti-IgM (F(ab’)2 goat anti-mouse IgM; Jackson ImmunoResearch; 

5 ug ml’), LPS (1 Lg ml’) and L47 cells (5 X 10* cells, irradiated), alone or in 
combinations, as indicated. g, h, Levels of p35 and Ebi3 mRNA expression were 
quantified by real-time PCR. Data show compilation of three independent 
experiments (mean + s.e.m.). i, Splenic murine B cells were activated as 
indicated for 72h, and treated with GolgiStop for the last 4h of culture. 
B-cell lysates were separated on SDS-PAGE gel and blotted with anti-EBI3 
antibody. Data show representative results from three independent 
experiments. j, Splenic B cell from C57BL/6 and p28 mice were stimulated with 
LPS (1pgml_'), CpG (1 pg ml‘), PGN (10 pg ml’), anti-CD40 

(10 pg ml}, alone or in combination, as indicated, for 72 h. p28 
concentrations in culture supernatants were determined by ELISA. Data shown 
(mean + s.em) are pooled from four independent experiments. 
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Extended Data Figure 2 | Characterization of immune reponses during 
EAE in mice lacking IL-35 expression in B cells. a, EAE was induced in BP? ‘ 
(black diamonds, n = 5) and corresponding BY" mice (white triangles, n = 4). 
Data show clinical EAE scores (mean + s.e.m.) from two independent 
experiments. Cumulative disease scores were compared using two-tailed 
unpaired t-test. b, EAE was induced in BP?’ (grey squares; n = 10) and 
corresponding B‘” mice (dark grey circles, n = 10). Splenocytes were collected 
from mice on day 28 after EAE induction, and re-stimulated individually for 
48h with MOG;;_55. Supernatants were analysed by ELISA to determine 
concentrations of IFN-y (middle) and IL-17 (right). Data (mean + s.e. m,) are 
pooled from two independent experiments. c, EAE was induced i in BP?” 
(triangles), Be (diamonds), pero” (squares), and B™” (circles) mice (see 
Fig. 2). Animals showing a body weight loss > 20% and/or sustained front leg 
impairment were killed in accordance with ethical regulations. Survival curves 
were compared using Wilcoxon test. d, Splenocytes were collected from B?** ‘ 
Be? and BT mice on day 28 after EAE induction (see Fig. 2a), and paclad 
before peer es for 48h with MOG3;_55. Culture supernatants were 
analysed by ELISA to determine concentrations of IFN-y (left) and IL-17 
(right). Data show representative results from two independent experiments. 
Statistical analysis was performed using the two-way-ANOVA test followed by 
a Bonferroni post-test (*P < 0.05, aes <0.01; pee < 0.001; mean + s.e.m.). 
Results of analysis are shown for B™" versus B??> ” comparison. e-j, EAE was 
induced in B’** “ (grey squares; n = 8) and corresponding B“" mice (dark 
grey circles; n = 8) and mice were killed at day 12 post-EAE induction to 
analyse accumulation of immune cells in CNS. Data show clinical EAE scores 


(mean = s.e.m.) (e). f-h, Mononuclear cells were isolated from CNS (brain plus 
spinal cord) of individual mice shown in (e), and numbers of total cells (f), 
CD4* T cells (g) and CD11b*Ly6G~ macrophages (h) were quantified by flow 
cytometry. Data shown (mean = s.e.m.) are pooled from two independent 
experiments. Results were compared using two-tailed unpaired t-test with 
Welch’s correction in case of inequal variances. i, j, Representative FACS 
plots of CD4* T cells and CD11b*Ly6G~ macrophages, respectively. 

k, Frequencies of Foxp3” Treg cells among CD4* CD8~ T cells in CNS at 
day 12 post-EAE induction (left panel), and in spleen at day 10 post-EAE 
induction (right panel) as determined by flow cytometry. Data shown in the left 
panel (mean = s.e.m.) are pooled from two independent experiments with 

10 mice per group in total. Data shown in the right panel (mean + s.e.m.) are 
representative of three independent experiments with 9 mice per group in total. 
Results were compared using two-tailed unpaired t-test. P values >0.05 are 
indicated by ns. l, Treg suppression assay. CD4* CD25" T cells (Treg) 

were FACS-sorted from pooled spleen and lymph nodes of B’ 35°" and 
corresponding BY” mice on day 10 post-EAE induction, and then co-cultured 
at the indicated ratios with 2 x 10 CD4* CD25" T cells (Teff) isolated from 
naive C57BL/6 mice in presence of 1 X 10° irradiated splenocytes and 

0.1 ug ml? anti-CD3 (clone 145-2C11). Proliferation was measured after 64h 
of culture by incorporation of [*H]thymidine, and plotted as suppression 
percentage defining as 100% the proliferation values obtained for control 
cultures without Treg cells. Data (mean + s.e.m.) are pooled from two 
independent experiments. 
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Extended Data Figure 3 | Lack of IL-35 production by B cells results in 
increased activation of B cells during EAE. a, EAE was induced in Bess’ 

(n = 6) and corresponding BY" mice (1 = 6). Mice were killed on day 10 after 
immunization and splenic B cells were analysed by flow cytometry for surface 
expression of antigen-presenting molecules (MHC II), co-stimulatory 
molecules (CD80, CD86), and activation markers (CD44, CD69). MFI, mean 
fluorescence intensity. Data (mean + s.e.m.) are pooled from two independent 
experiments. Results were compared using two-tailed unpaired t-test with 
Welch’s correction in case of inequal variances. b, Representative histogram 
plots showing expression of these molecules on wild-type and p35-deficient 
B cells. Dead cells were excluded using propidium iodide. c, EAE was induced in 


-~O- B-WTBcells + B-WT Teff 
<>  B-WTB cells + B-p35”Teff 


—@ = B-p35"B cells + B-WT Teff 


-@  B-p35"B cells + B-p35"- Teff 


BP?" and corresponding B7 mice. B cells and CD4* CD25" T cells 

(Teff) were isolated from pooled lymph nodes and spleens on day 10 after 
immunization. To test the APC function of B cells (see Fig. 2), 5 X 10° B cells 
from B°>” or B“T mice were then cultured with 1 X 10 Teff cells from BP? 
or B™? mice in presence of MOG35-5s, as indicated. Culture supernatants were 
collected after 48 h, and analysed by Bio-Plex to determine concentrations of 
IFN-y (left) and IL-6 (right). Data (mean + s.e.m.) are pooled from two 
independent experiments. Results were compared using two-way ANOVA 
followed by Bonferroni post-test (*P < 0.05). Results of analysis are shown for 
comparison, B” B cells plus B“” T cells versus B?* ” B cells plus B?? a 
cells for IL-6. 
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Extended Data Figure 4 | Bacterial burden and T-cell activation in mice 
lacking IL-35 production by B cells during Salmonella infection. a, B’*° i 
BES P40” and corresponding B“”" mice were infected intravenously with 
virulent Salmonella strain SL1344 (100 c.f.u.). Bacterial loads in spleens (top) 
and livers (bottom) were determined on day 6 post-infection (p.i). Data 
(mean = s.e.m.) are pooled from two independent experiments. Numbers 

of mice analysed: BPs’ (n = 6) and corresponding BYT mice (n = 8), 
Bees’ (n = 5) and corresponding BY mice (n = 5), BP?” ’ (n = 6) and 
corresponding control B“? mice (n = 5). Data were analysed with two-tailed 
unpaired f-test. P values > 0.05 are indicated by ns. b, Top shows liver 
histochemistry of total infiltrating cells (haematoxylin and eosin; H&E), 

T cells (CD3), and macrophages (F4/80) at day 6 p.i.; bottom, numbers of 
inflammatory foci per liver section on day 6 p.i. Data (mean + s.e.m.) are 
pooled from two independent experiments. Numbers of mice analysed: 
Bess” (n = 6) and corresponding BY mice (n = 8), Breis’ (n= 7) and 
corresponding BT mice (n = 5), BP?? : (n = 7) and corresponding BY? mice 
(n = 8). Data were compared using two-tailed unpaired t-test with Welch’s 
correction in case of inequal variances. P values > 0.05 are indicated by ns. 

c, BPs” (n = 6) and corresponding BY (n = 6) mice were infected 
intravenously with attenuated Salmonella (strain SL7207). On day 21 p.i. bone 
marrow (BM) cells were isolated from individual mice, and re-stimulated for 
6h with heat-killed Salmonella before staining for surface CD4, and 
intracellular CD40L or IFN-y. (top). Representative FACS plots showing 
CD40L and IFN-y expression by CD4* T cells, after gating on CD4* cells. 
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Bottom, frequencies of CD40L* (left) and IFN-y* (right) cells among cD4~ 
T cells in BM (mean + s.e.m.). Data are pooled from two independent 
experiments. Data were compared using two-tailed unpaired t-test. P values 
> 0.05 are indicated by ns. d, Representative FACS plot shows Foxp3 
expression by splenic CD4" T cells from a B“* mouse on day 21 p.i. with 
attenuated Salmonella (strain SL7207); (top right) frequency of Foxp3* Treg 
cells among CD4* T cells in spleens of naive B**” (n = 5) and corresponding 
BY mice (n = 5), BP” ‘ (n =7) and corresponding BT mice (n = 7); 
(bottom) frequency of Foxp3* Treg cells among CD4™ T cells on day 21 p.i. 
with attenuated Salmonella (strain $L7207) in spleens of B’ aaa (n = 8) and 
corresponding BYT mice (n = 8), BP’? : (n = 10) and corresponding BY 
mice (n = 10), BP” ‘ (n = 15) and corresponding BW? mice (n = 9), BPS ‘ 
(n = 6) and corresponding BW" mice (n = 6). Data (mean + s.e.m.) are pooled 
from two independent experiments. Data were analysed with two-tailed 
unpaired t-test. P values >0.05 are indicated by ns. e, Mice were infected as in 
(d) and frequency of activated cp44"cpe62L® cD4* and CD8* T cells 
were determined by flow cytometry. Representative FACS plot shows 
expression of CD44 and CD62L by live (PI negative) CD4™ T cells. Graphs 
show frequencies of cD44"'CD62L" cells among CD4* T cells (left) and 
CD8* T cells (right). Data (mean + s.e.m.) are pooled from two independent 
experiments. Numbers of mice analysed: B?? (n= 14) and corresponding 
BY? mice (n = 12), Bees’ (n = 10) and corresponding BY" mice (n = 10), 
Be” (n = 15) and corresponding BY mice (n = 9), BP8 “ (n= 6) and 
corresponding BYT mice (n = 6). 
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Extended Data Figure 5 | Role of IL-35 and CD40 expression by B cells 
during Salmonella infection. a, B’ ae , Bev ‘ , BP? “ and their 
corresponding BY” mice were infected with attenuated Salmonella (strain 
SL7207). Relative titres of Salmonella-reactive IgM and IgG in serum were 
determined by ELISA. Data (mean = s.e.m.) are pooled from three 
independent experiments. Numbers in brackets indicate the numbers of mice 
analysed for each measurement. b, B??” ’ (n = 17) and corresponding 

BT mice (n = 16) were immunized with 200 pg NP-OVA in alum 
intraperitoneally. Relative titres of NP(15)-BSA-reactive antibodies in serum 
were determined by ELISA. Data (mean + s.e.m.) are pooled from three 
independent experiments. Results were compared using two-tailed unpaired 


t-test. P values >0.05 are indicated by ns. ¢, Mice were infected with 100 c.fu. of 
virulent Salmonella. Upper left shows survival curves of B“““ “ (n=9) and 
corresponding BY” mice (n = 16). Data are pooled from two independent 
experiments. Survival curves were compared using Wilcoxon test. Upper 
right, numbers of inflammatory foci per liver section were determined on 
day 5 p.i. Lower panel, bacterial loads in liver (left) and spleen (right) 

were determined on day 5 p.i. Data (mean + s.e.m.) are pooled from two 
independent experiments. Numbers of mice analysed: B“”” “ (n= 12), and 
corresponding B? mice (n = 13). Results were analysed using two-tailed 
unpaired t-test with Welch’s correction. 
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Extended Data Figure 6 | Characterization of IL-10- and IL-35-expressing 
plasma cells during Salmonella infection. a, b, CS57BL/6 mice were infected 
intravenously with attenuated Salmonella (strain SL7207; 10’ c.f.u.). On 
indicated days p.i., CD138" plasma cells and CD19*CD138~ B cells were 
isolated from spleen through magnetic and FACS procedures. a, Strategy for 
isolation of CD138™ plasma cells and CD19* CD138° B cells: splenocytes were 
stained with anti-CD138-PE followed by labelling with magnetic anti-PE 
microbeads, and magnetic separation on autoMACS (Miltenyi Biotech). The 
obtained CD138-positive and CD138-negative fractions were stained for CD19, 
CD138 and CD11b/CD11c/TCR-B/DAPI. CD138%CD11b- CD11¢ TCR- 

B- DAPI” and CD19*CD138" CD11b- CD11c TCR-B DAPI” cells were 
then isolated from CD138-positive and CD138-negative fraction, respectively, 
by FACS, as bulk or single cells. Numbers in FACS plots indicate the 
percentages of CD19*CD138" B cells and CD138" plasma cells in total 
splenocytes on day 3 after challenge (left plots), after magnetic isolation (middle 
plots), and after FACS isolation (right plots). b, Single CD138" cells were 


isolated on indicated days, and analysed by single cell-PCR using a mix of 
primers for B-actin, 1110, p35 and Ebi3. Only cells giving a positive signal for 
B-actin were included in the calculations shown. Data show frequencies of 
CD138" cells that gave a positive signal for 1110, both p35 and Ebi3, or 1110 and 
p35 and Ebi3. Numbers of cells tested: day 1 (181 cells; 86% positive for B-actin), 
day 3 (156 cells; 79% positive for B-actin) and day 8 (163 cells; 97% positive for 
B-actin). Cells were obtained from two independent experiments. c, d, cpb138"! 
plasma cells and CD19*CD138~ B cells were enriched by magnetic isolation 
(as described in a) from spleen of C57BL/6 mice on day 3 after infection with 
attenuated Salmonella (SL7207; 10’ c.f.u.). c, Data show expression levels of 
selected surface receptors on live (DAPI) CD138™ plasma cells and 
CD19*CD138" B cells. Cells were gated as in a (middle panels), and analysed 
by flow cytometry. d, Levels of CD1d and Tim-1 expressed by CD138™ cells 
were compared to those found respectively on CD1d™ and CD1d"° B cells, and 
on Tim1* and Tim1~ B cells from spleen of naive C57BL/6 mice. Data show 
representative results from 2-5 independent experiments. 
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Extended Data Figure 7 | Characterization of plasma cells during 
Salmonella infection. a, CD138 plasma cells and CD19*CD138 B cells 
were magnetically enriched from spleens of C57BL/6 mice on day 3 after 
infection with attenuated Salmonella (SL7207; 10’ c.f.u.). CD19*CD138— 

B cells (P1) were then isolated from the CD138-negative fraction by FACS. 
CD138™CD22* (P2), CD138"CD22* (P3), and CD138"'CD227 (P4) were 
isolated from the CD138-positive fraction by FACS. b, Frequencies of IgM- and 
IgG-secreting cells among B cells (CD19*CD138_ ) and each of these plasma 
cell subsets (CD138'"CD22*, CD138"'CD22*, CD138"'CD227) were 
determined by ELISPOT assay. A compilation of two independent experiments 
is shown (mean + s.e.m.). ¢, Blimp1, Irf4, Pax5, 1110, Ebi3 and p35 mRNA 
expression was quantified by real-time PCR in these populations. Naive splenic 
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B cells (naive CD19") were isolated from unchallenged C57BL/6 mice by 
magnetic selection. Data show the compilation of three independent 
experiments (mean = s.e.m.). d, Plasma cells accumulate in splenic red pulp 
aggregates adjacent to white pulp areas on day 3 after Salmonella infection. 
C57BL/6 mice were infected intravenously with attenuated Salmonella 
(SL7207; 10’ c.f.u.), and spleens were harvested 3 days later. Spleen sections 
(7 um) were stained with: top panels: anti-Ig-« (clone 187.1; green), anti-Igi 
(clone SL136; green), anti-CD90 (clone T24; red) and anti-CD22 (clone OX-97; 
blue); middle panels: anti-Ig-« (clone 187.1; red), anti-Ig (clone SL136; red), 
anti-CD11b (clone M1/70; green), and anti-CD22 (clone OX-97; blue); bottom 
panels: anti-Ig-« (clone 187.1; red), anti-Igd (clone SL136; red), anti-CD169 
(clone MOMA-1; green), and anti-CD22 (clone OX-97; blue). 
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Extended Data Figure 8 | Production of IL-10 and IL-6 by plasma cells 
and B cells after Salmonella infection. a, CD138" plasma cells and 
CD19*CD138~ B cells were isolated from spleen of C57BL/6 mice on day 3 
after infection with attenuated Salmonella (SL7207; 10’ c.f.u.), as described in 
Extended Data Fig. 6. Isolated cells were activated for 24h as indicated with LPS 
(1 pg ml’), anti-CD40 (10 pg ml~’), and anti-IgM (5 ig ml‘). IL-10 (left) 
and IL-6 (right) concentrations were determined by Bio-Plex. Data shown are 
pooled from 2-5 independent experiments (mean + s.e.m.). Results were 
compared using two-tailed unpaired t-test. P values > 0.05 are indicated by ns. 
b,c, cp138" plasma cells were isolated from spleen of C57BL/6 or IL-10-eGFP 
mice on day 3 after infection with attenuated Salmonella (SL7207; 10’ c.f.u.). 
b, Cells were analysed by flow cytometry directly after isolation to quantify 
IL-10-eGFP expression (ex vivo), and 24h after re-stimulation with PMA plus 
ionomycin. Data show representative FACS plots and histogram overlays. 
Numbers in the plots indicate the percentages of IL-10-eGFP-positive cells 
among total live (PI) cells. c, Cells isolated from C57BL/6 and IL-10-eGFP 
mice were re-stimulated for 24h, as indicated with LPS (1 ug ml '), anti-CD40 
(10 pg ml“), anti-IgM (5 pg ml~'), IL-4 (20ng ml’), IL-5 (20ng ml’), IL-6 
(20 ng ml *) and IL-21 (20 ng ml’). Cells were analysed as in b. Data shown 
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are pooled from 2-4 independent experiments (mean + s.e.m.). Results were 
compared between condition RPMI and activated conditions (left) or between 
condition with added cytokine and condition LPS plus anti-CD40 (right) using 
one-way ANOVA followed by Bonferroni post-test (*P < 0.05, **P < 0.01; 
***D <(),001). d-g, Expression of IL-6 by IL-10-expressing CD1d™ B cells. 
d, CD19*CD1d" and CD19*CD1d"° cells were isolated from spleen of naive 
C57BL/6 mice according to CD19 and CD1d expression using magnetic and 
FACS procedures, and stimulated for 24h, as indicated. IL-6 and IL-10 
concentrations in culture supernatants were determined by Bio-Plex. 

e, f, CD1d" B cells were isolated from naive IL-10-eGFP mice and stimulated 
with LPS (2 pg ml’) for 48h. IL-6 and IL-10 concentrations in culture 
supernatants were determined by Bio-Plex (e), and cells were analysed by flow 
cytometry to determine the frequency of IL-10-eGFP-expressing cells (f). 

g CD1d" B cells were isolated from naive IL-10-eGFP mice, and stimulated 
with LPS (2 ug ml ~ ) for 48h. IL-10-eGFP™ and IL-10-eGFP” cells were then 
separated by FACS and analysed by real time PCR for expression levels of [110 
and I/6 mRNA. Data shown are pooled from two independent experiments 
(mean = s.e.m.). 
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Extended Data Figure 9 | IL-35 and IL-10 production by plasma cells during 
EAE and Salmonella infection. a, B cells and plasma cells were isolated from 
spleens of mice on days 14 and 28 after EAE induction through magnetic 
and FACS procedures. Splenocytes were stained with anti-CD138-PE followed 
by labelling with anti-PE microbeads, and magnetic separation. The obtained 
CD138-positive and CD138-negative fraction were stained for CD19, CD138, 
and CD11b/CD11c/CD90/DAPI. CD138*CD11b- CD11c CD90" DAPI~ 
and CD19*CD138- CD11b- CD11c CD90 DAPI cells were then isolated 
from CD138-positive and CD138-negative fractions, respectively, by FACS, as 
bulk cells. Naive splenic B cells (naive CD19") were isolated from unchallenged 
C57BL/6 mice by magnetic selection. Blimp1, Irf4 and Pax5 mRNA expression 
was quantified by real-time PCR. Data compile 2 independent experiments 
(mean ~ s.e.m.). b, Production of IgM and IgG by plasma cells and B cells 
isolated from mice during EAE was determined by ELISPOT. A compilation of 
3 independent experiments for day 14, and 2 independent experiments for day 
28 is shown (mean + s.e.m.). ¢, Left, CD19*CD138" B cells (B cell) and 
CD138* plasma cells (plasma cell) were isolated from spleens of mice on days 
14 and 28 after EAE induction through magnetic and FACS procedures, as 
described in a, and stimulated for 24h with PMA plus ionomycin (PMA plus 
iono) or kept in medium (@). IL-10 concentrations were determined by 
Bio-Plex. Data are pooled from 4 independent experiments for day 14 and 2 
independent experiments for day 28 (mean + s.e.m.). Right, B cells 
(CD19*CD138_) and CD138* plasma cells (CD138~) were isolated from 
spleens of mice on day 14 after EAE induction through magnetic isolation. For 
this, splenocytes were stained with anti-CD138-PE followed by labelling with 
anti-PE microbeads, and magnetic separation to yield CD138" plasma cells. 
B cells were obtained from the CD138-negative fraction after further depletion 
of remaining CD138" cells, and positive selection using anti-CD19 magnetic 
microbeads. Purities were above 95%. Cell lysates were separated on SDS- 
PAGE gel and sequentially blotted with anti-EBI3, anti-p35, and anti-actin 
antibodies. Data show representative results of two independent experiments. 
d, e, Mice in which B cells cannot co-express IL-10 and IL-35 do not develop 
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exacerbated EAE. d Left, B??> ” (50%) ee (50%) were obtained by reconstituting 
lethally irradiated recipient mice with BM cells from yey mice (80%), IL-10- 
deficient mice (10%), and p35-deficient mice (10%). BY * mice, were produced 
using C57BL/6 BM instead of JHT BM cells. BP?" and B™?” mice were 
obtained as outlined in Extended Data Fig. 10. Right, EAE was induced in 
BPs’ (50%). ” (so) Mice (grey diamonds, n = 10), B? Bort (grey squares, 
n= 10), Bie: (white diamonds, n = 5) and BY? (dark grey circles, n = 10) 
mice. Data show clinical Ear scores. e, Splenocytes were harvested from 
P35’ (50%) : (50%) and BT mice on day 28 after EAE induction, and re- 
stimulated individually for 48 h with MOG,;_;5. Culture supernatants were 
analysed by ELISA to determine concentrations of IFN-y and IL-17. Data 
(mean + s.e.m.) are pooled from two independent experiments. f, Production 
of IL-10 by IL-35-deficient B cells in vitro. Splenic B cells from naive C57BL/6, 
p35’, and p35 ' Ebis ’~ mice were stimulated with LPS (1 pg ml’), 
anti-CD40 (10 pg ml), alone or in combination, as indicated, for 72h, and 
IL-10 concentrations were determined by Bio-Plex. Data (mean ~ s.e.m.) are 
pooled from three independent experiments. Results were compared between 
C57BL/6 cells and cells from mutant mice using one-way ANOVA followed 
by Bonferroni post-test (*P < 0.05). g-i, IL-35 is not necessary for production 
of IL-10 by CD138" plasma cells during Salmonella infection. CD138™ plasma 
cells were isolated from spleen of C57BL/6, IL-10-eGEP, p35/~, and p35 ‘~ 
IL-10-eGFP mice on day 3 after infection with attenuated Salmonella 
(107 c.f.u.) using a combination of magnetic and FACS methods. Cells were 
analysed by flow cytometry (g) to determine the frequency of IL-10-eGFP- 
expressing cells after re-stimulation as indicated for 24h (h). Numbers in the 
plots indicate the percentages of IL-10-eGFP-positive cells among total live 
(PI_) cells. Results were compared using two-tailed unpaired t-test. i, CD138" 
plasma cells isolated from C57BL/6 and p35 ‘~ mice were re-stimulated for 
24h as indicated. IL-10 concentrations were determined by Bio-Plex. Data are 
pooled from two independent experiments (mean + s.e.m.). Results were 
compared using two-tailed unpaired t-test with Welch’s correction. P values 
>0.05 are indicated by ns. 
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Extended Data Figure 10 | Generation and validation of bone marow 
chimaera. a, Scheme for the generation of B* "and corresponding control 
BYT mice. Left, BX” mice with a deficiency in gene X restricted to B cells were 
obtained by reconstituting lethally irradiated recipient mice with a mixture 
: BM cells from B-cell-deficient JHT mice (80%) and from gene X-deficient 
‘X~'~) mice (20%). The table indicates the contribution of JHT and X-/~ BM 
ee to relevant haematopoietic cell types in the resulting chimaera, taking into 
account that JHT mice carry wild-type (WT) alleles of gene X.In B* " mice, 
B cells only developed from X~/~ BM cells and therefore were all X‘~. In 
contrast, all other haematopoietic lineages developed from both X ‘~ and JHT 
BM cells. As the JHT and X'~ BM cells were injected in a ratio 80:20, most 
haematopoietic cells other than B cells carried a wild-type genotype for X in 
BX” mice. Right, the corresponding control B’ mice were obtained by 
reconstituting lethally irradiated recipient mice with a mixture of BM cells from 
WT C57BL/6 mice (80%) and from X~/~ mice (20%). The table indicates the 
contribution of WT and X ‘~ BM cells to relevant haematopoietic cell types in 
the resulting chimaera. In B“ mice, all haematopoietic lineages, including 
B cells, developed from WT (80%) and X /~ (20%) BM cells. Therefore B cells 
mostly had a wild-type genotype in B“” mice, which also controlled for 
possible effects owing to the 20% X /~ haematopoietic cells other than B cells 
present in B*” mice (for example, 20% X T cells, and so on). b, Top, 


65.3% : 


cp19 —————_——_> 


representative FACS plots of isolated B cells, CD4* T cells and CD11b” cells 
purified from naive B’”?” and BY” mice, using magnetic and FACS-based 
methodologies; (bottom) genomic DNA was extracted from the sorted cell 
populations, including the non-labelled B220 CD4 CD11b cells (indicated 
as Non-T,B,M). Genomic DNA samples from tail biopsies of C57BL/6 and 
Ebi3 '~ mice were used as controls. The amounts of genomic DNA present 
in each sample were equilibrated using the Mog gene as standard. Using 
equilibrated samples, the wild-type Ebi3 allele was detected by PCR in all DNA 
preparations except for B cells from BPS” mice, and the tail biopsy from 
Ebi3 ‘~ mice. c, Reconstitution of the B-cell compartment in BM chimaeric 
mice. B?>> ’ , Bee a , BP? “and corresponding BW" controls were killed after 
reconstitution. Data show the percentage of CD19* B cells in spleen, 

as determined by flow cytometry. Data are pooled from 2-3 independent 
a Data (mean + s.e.m.) include B?? "(n= cu and corresponding 
BT mice (n = 7), Breis” (n= 7) and 1 corresponding BY? mice (n = 9), 
peo’ (n= 7) and conesponding BT mice (n = 7). Comparisons of B-cell 
reconstitutions between B* ’ and corresponding BY’ mice were done with 
two-tailed unpaired t-test with Welch’s correction (P values > 0.05 are 
indicated as ns). d, Representative FACS plots of B cells from BY? and 

BP>" mice. Cells are gated on live (PI ) spleen cells. 
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Obesity-associated variants within FTO form 
long-range functional connections with IRX3 
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Nancy J. Cox'*, Chi-Chung Hui’, Jose Luis Gomez-Skarmeta* & Marcelo A. Nobrega' 


Genome-wide association studies (GWAS) have reproducibly assoc- 
iated variants within introns of FTO with increased risk for obesity 
and type 2 diabetes (T2D)'*. Although the molecular mechanisms 
linking these noncoding variants with obesity are not immediately 
obvious, subsequent studies in mice demonstrated that FTO expres- 
sion levels influence body mass and composition phenotypes**. How- 
ever, no direct connection between the obesity-associated variants 
and FTO expression or function has been made’’. Here we show 
that the obesity-associated noncoding sequences within FTO are 
functionally connected, at megabase distances, with the homeobox 
gene IRX3. The obesity-associated FTO region directly interacts with 
the promoters of IRX3 as well as FTO in the human, mouse and 
zebrafish genomes. Furthermore, long-range enhancers within this 
region recapitulate aspects of IRX3 expression, suggesting that the 
obesity-associated interval belongs to the regulatory landscape of 
IRX3. Consistent with this, obesity-associated single nucleotide poly- 
morphisms are associated with expression of IRX3, but not FTO, in 
human brains. A direct link between IRX3 expression and regula- 
tion of body mass and composition is demonstrated by a reduction 
in body weight of 25 to 30% in Irx3-deficient mice, primarily through 
the loss of fat mass and increase in basal metabolic rate with browning 
of white adipose tissue. Finally, hypothalamic expression of a dominant- 
negative form of Irx3 reproduces the metabolic phenotypes of Irx3- 
deficient mice. Our data suggest that IRX3 is a functional long-range 
target of obesity-associated variants within FTO and represents a 
novel determinant of body mass and composition. 

Noncoding variation in single nucleotide polymorphisms (SNPs) 
within a 47-kilobase (kb) region of high linkage disequilibrium in introns 
1 and 2 of FTO remains the strongest genetic association with risk to 
polygenic obesity in humans'~. Individuals homozygous for risk alleles 
of the associated SNPs weigh approximately 3 kg more than individuals 
homozygous for non-risk alleles, underscoring the significant pheno- 
typic impact of these common variants”. Multiple follow-up studies directly 
implicated FTO as a gene controlling body mass and composition. Null 
Fto alleles in mice result in lean phenotypes, with reciprocal models 
overexpressing FTO displaying increased body weight*°. However, 
there is no direct evidence that enhancers within the obesity-associated 
FTO introns are connected with regulation of FTO expression. Expres- 
sion quantitative trait locus (eEQTL) analyses have systematically failed 
to show association between the obesity-associated SNPs and FTO expres- 
sion in human tissues’ ’. 

To chart directly the cis-regulatory circuitry within the FTO locus, 
we used circular chromosome conformation capture followed by high- 
throughput sequencing (4C-seq). We carried out 4C-seq in whole mouse 
embryos (embryonic day 9.5; E9.5) and in adult (8 weeks) mouse 
brains, as previous work suggests that brain FTO expression modulates 


metabolic parameters*’®. We profiled the genomic interactions with 


promoters of genes located within a 1-megabase (Mb) window around 
the obesity-associated SNPs (mouse genome build mm9, chromosome 
8: 93,725,000-94,725,000), including Fto and Rpgrip 1], and Irx3, a half- 
megabase downstream. There was a clear difference in interaction pat- 
terns among genes (Fig. la). The Fto promoter chiefly participates in 
genomic interactions proximal to the gene promoter. Although in mouse 
embryos these interactions include the obesity-associated intronic region 
(Fig. 1b), no such interactions are detected in adult mouse brains (Ex- 
tended Data Fig. 1 and Supplementary Table 1). In marked contrast, 
the promoter of Irx3 participates in numerous long-range interactions 
across a broad genomic region encompassing nearly 2 Mb, including 
robust interactions with the obesity-associated interval within FTO 
(Fig. 1a). We confirmed these interactions between Irx3 and the Fto 
obesity-associated region using chromatin conformation capture (3C) 
in adult mouse brains (Extended Data Fig. 2c). These data suggest that 
the obesity-associated interval is likely to be part of the regulatory 
landscape of Irx3. We next inferred that the long-range interactions 
between the obesity-associated FTO intron and IRX3 represent a con- 
served feature in vertebrate genomes. Inspecting the human ENCODE 
data set, we observe that human breast cancer MCF-7 cells display the 
same pattern of long-range looping between the obesity-associated inter- 
val and IRX3, but not FTO, assayed by chromatin interaction analysis 
with paired-end tag sequencing (ChIA-PET) (Extended Data Fig. 2a). 
This is corroborated further by recent Hi-C (high-throughput 3C) data 
in human fibroblasts’? (Extended Data Fig. 2b). We also observed a 
similar pattern of chromatin interactions in zebrafish embryos, assayed 
by 4C-seq (Extended Data Fig. 1c). Together, these data suggest that 
the obesity-associated FTO intron is mediating functional interactions 
with JRX3 in the human, mouse and zebrafish genomes. 

After revealing evidence of direct looping between FTO intronic regions 
and IRX3, we next showed that enhancers within the obesity-associated 
interval possess functional characteristics compatible with IRX3 expression. 
We observed that the degree of evolutionary conservation in noncod- 
ing sequences of the FTO-IRX locus (chromosome 16: 53,731,249- 
54,975,288) places it on the top 2% of similarly sized genomic intervals, 
suggesting the presence of many functional noncoding elements spread 
over the region (see Methods). More specifically, ENCODE data suggest 
that the 47-kb obesity-associated interval is fraught with cis-regulatory 
elements, evidenced by an abundance of enhancer-associated chromatin 
marks, DNase hypersensitive sites, and transcription-factor binding 
events across this genomic region (Fig. 1b). We tested three human DNA 
fragments from the 47-kb obesity-associated region for their putative 
enhancer properties using an in vivo mouse reporter assay. These frag- 
ments overlap enhancer-like chromatin marks in multiple cell lines. All 
three fragments displayed enhancer activity in neonatal (postnatal day 
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Figure 1 | Long-range interactions in the IRX3-FTO locus. a, Mouse 
embryo 4C-seq interactions emanating from each promoter are displayed as 
links across the circle (darker link implies greater significance). Outer plots 
show significance of interactions above background (—log(P value)). The 
obesity-associated interval is highlighted red. b, Magnified view of the 
association interval. Contained are the orthologous locations of obesity- 
associated SNPs (black pips), and epigenetic marks associated with regulatory 
elements. Endogenous Irx3 expression is shown in lung and brain in Irx3"*“* 
knock-in mouse (top two images). Three enhancers drive reporter expression 
in lungs and brain (rectangles in the Enhancers row, corresponding to 
bottom three images). Other enhancers’’ are shown in black. HS, DNase 
hypersensitive sites. 


0 (P0)-P1) mice. Interestingly, two of the fragments drove strong reporter 
gene expression in lungs (Fig. 1b). Irx3 is highly expressed during lung 
development (Fig. 1b)'*'°, whereas Fto expression in lungs is limited 
(Extended Data Fig. 3a and refs 2, 14-16). Consistent with our data, a 
previous report has shown that the FTO obesity-associated region 
contains enhancers that drive Irx3-like expression in zebrafish”. 

Our data suggest that the regulatory landscape of IRX3 spreads over 
megabase distances, whereas FTO expression is primarily regulated by 
regions proximal to its promoter. To test this, we engineered a human 
bacterial artificial chromosome (BAC) spanning 162 kb of the FTO 
locus, including its promoter and the 47-kb obesity-associated region 
(Extended Data Fig. 3). We recombineered a reporter cassette at the 
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FTO translation start, and generated transgenic mice harbouring the 
engineered BAC. Transgenic mice expressed the reporter gene in mul- 
tiple tissues, recapitulating the endogenous expression pattern of FTO™. 
We next determined that a 1.2-kb region corresponding to the FTO 
promoter is sufficient to recapitulate most of the FTO expression pat- 
tern (Extended Data Fig. 3). In contrast, a 2.8-kb region corresponding 
to the IRX3 promoter does not recapitulate any of the endogenous 
expression patterns of IRX3, suggesting that IRX3 expression relies 
on long-range cis-regulatory elements. A total of 15 human sequences 
in the broader FTO-IRX locus have been characterized as in vivo enhan- 
cers in mouse’”’’, driving reporter expression in Irx3-expressing tis- 
sues, including eye, limb, brain, neural tube and branchial arches'*”” 
(Supplementary Table 2). Although these enhancers lie outside the obe- 
sity association interval, they are consistent with our chromatin looping 
data pointing to a broad Irx3 regulatory landscape, extending into Fto. 
Further supporting these findings, the genomic interval spanning FTO 
and IRX3 has been proposed to be part of a single genomic regulatory 
block, based on the presence of extended synteny across deep phylo- 
genies and a high density of highly conserved noncoding elements”, as 
well as patterns of CTCF binding and chromatin interactions”. Under 
these models, our data support the idea that the broad expression patterns 
of FTO are primarily regulated by elements proximal to the promoter, 
and that IRX3 is endowed with an ancient, extensive cis-regulatory cir- 
cuitry extending into FTO. 
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Figure 2 | BMI-associated SNPs are associated with expression of IRX3, but 
not FTO, in human brain. a, SNPs associated with BMI in the FTO locus. 
Forty-three SNPs associated with IRX3 expression (eSNPs) are shown in red, 
including 11 also associated with BMI. No SNPs are associated with FTO 
expression (P > 0.05). Thick arrowheads, IRX3 eSNPs outside the obesity- 
associated region (pink); thick arrow, rs9930506. b, In cerebellum, the allele of 
1s9930506 associated with increased BMI (risk allele) is correlated with 
increased IRX3 expression and not with FTO expression. Vertical middle bars 
represent the first and third quartiles, horizontal lines represent the median, 
and open circles denote outliers beyond 1.5 times the interquartile range. 

c, Histogram plotting the number of SNPs associated with IRX3 expression in 
adipose (black) and brain (red) (y axis) by the significance of their association 
with BMI (x axis). Full details of statistics are provided in the Methods. 
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We next determined that gene expression studies in human brains 
corroborate our chromatin looping data, showing that the obesity- 
associated SNPs are associated with expression levels of IRX3, but not 
FTO. Our data and that of others’’ (Fig. 1b) demonstrate that the obesity- 
associated SNPs overlap several in vivo enhancers, raising the possibility 
that allelic variants may disrupt enhancer activity and alter expression 
of their target gene(s). To test this, we carried out eQTL mapping in 
human brain samples. FTO and IRX3 are highly expressed in multiple 
regions of the brain including cerebellum and hypothalamus (Extended 
Data Fig. 4). Using a data set of 153 brain samples from individuals of 
European ancestry”', represented by cerebellum, we found significant 
association between 11 SNPs previously associated with increased body 
mass index (BMI), and expression of IRX3, but not FTO (Fig. 2a and Sup- 
plementary Table 3). These common SNPs have minor allele frequencies 
ranging from 42.5 to 47.5% and are in strong linkage disequilibrium with 
each other (Extended Data Fig. 5). In all cases, such as for SNP rs9930506 
(Fig. 2b), the allele associated with increased BMI is also associated with 
increased IRX3 expression. Further analysis, using the GIANT Consor- 
tium data set of SNPs associated with BMI in 249,796 individuals”’, 
reveals that among SNPs significantly associated with IRX3 expression 
in brain or adipose tissue, only those associated with IRX3 in brain show 
highly significant associations with BMI (Fig. 2c). Taken together, our 
data directly tie the noncoding genetic variation within FTO to tissue- 
specific modulation of IRX3, but not FTO, expression in human brain. 

Next, we exploited animal models to determine a potential role for 
IRX3 expression in the regulation of BMI and/or metabolism. Mice homo- 
zygous for an Irx3-null allele (Irx3-knockout mice) are viable and fertile, 
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with no evidence of embryonic lethality. We observed a 25-30% reduc- 
tion of body weight in Irx3-knockout mice compared to control litter- 
mates (wild type), independent of gender (Fig. 3a and Extended Data 
Figs 6a, b and 7a, b). This difference becomes more pronounced if 
animals were subjected to a high-fat diet (HFD), with Irx3-knockout 
animals showing no significant body weight gain, contrasting to a 
63% increase in control animals (Fig. 3a, b). Importantly, the percent- 
age of fat mass in Irx3-knockout mice was significantly reduced without 
marked change of the lean mass ratio (Fig. 3c, d and Extended Data 
Fig. 7c, d). Irx3-knockout mice exhibit marked reduction in adiposity 
with smaller fat depots as well as reduced adipocyte size (Fig. 3e and 
Extended Data Figs 6b-d and 7e, f). These results were confirmed by 
differential gene expression of adiposity markers (that is, leptin, adi- 
ponectin and mcp1) in the perigonadal white adipose tissue (PW AT) of 
Irx3-knockout mice (Extended Data Fig. 6e). Importantly, Fto express- 
ion was not altered in the hypothalamus or PWAT of Irx3-knockout 
mice, suggesting that the lean phenotype of Irx3-knockout mice is not 
associated with Fto (Fig. 3f). As Irx3-knockout mice were resistant to 
HEFD-induced obesity and metabolic disorder, such as hepatosteatosis 
(Fig. 3e), we examined glucose homeostasis by performing glucose 
tolerance tests (GTT) and insulin tolerance test (ITT) at different time 
points in the presence or absence of HFD. In 8-week-old mice, no 
difference in GIT was found (Fig. 3g). Although ageing and HFD led 
to glucose intolerance and insulin resistance in wild-type mice, Irx3- 
knockout mice showed none of these metabolic phenotypes (Fig. 3g-)). 

Indirect calorimetric analysis showed higher energy expenditure in 
Irx3-knockout mice (Fig. 3k and Extended Data Fig. 8a, b, e). Importantly, 
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Figure 3 | Irx3-deficient mice are leaner and are protected against diet- 
induced obesity. a, Body weight in wild-type (WT) and Irx3-knockout (Irx3 
KO) mice fed a normal diet (ND) or high-fat diet (HFD). BL, body length. 

b, Weight gain in ND or HED. c, Fat and lean mass in WT and Irx3 KO mice. 
d, Fat and lean mass ratio as a percentage of body weight. e, Sections of inguinal 
white adipose tissue (IWAT; subcutaneous), perigonadal WAT (PWAT; 
visceral), brown adipose tissue (BAT), and liver from HFD mice. f, Fto 
messenger RNA expression in hypothalamus and PWAT of Irx3 KO and WT 
mice. g-i, Glucose tolerance tests (GTT) in WT and Irx3 KO mice. Inset graphs 
show area under curve (AUC). j, Insulin tolerance test (ITT) in HFD mice. 


k, Energy expenditure of ND and HED mice. 1, Gene expression in PWAT. 
m, Irx3 expression in the arcuate nucleus and median eminence (Irx3h@2/*), 
Left panel shows a ventral view of a whole-mount stained brain. A dashed red 
line indicates the position of cross-section, displayed in the right panel. 
B-galactosidase-stained area in arcuate nucleus is marked by a dashed black 
line. 3V, third ventricle; HP, hypothalamus; MEm, median eminence; OC, optic 
chiasm. Data are expressed as mean + s.e.m. *P < 0.05 compared to WT or 
control; +P < 0.05 compared to each corresponding control group (see 
Methods). See additional statistical detail in Methods. 
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Irx3-knockout mice show upregulation of brown adipocyte markers 
in PWAT, including Ucp1, Ppargcla, Prdm16 and cidea, as well as 
increased expression of Adrb3 encoding the B3-adrenergic receptor, 
suggestive of elevated sympathetic activation (Fig. 31). Furthermore, we 
found a significant increase of Ucp1 expression in the brown adipose 
tissue (BAT) of Irx3-knockout mice (Extended Data Fig. 8e, f). WAT 
predominantly stores excessive calories and its accumulation causes 
obesity, whereas brown adipose tissue dissipates energy (lipids and glu- 
cose) and has been the focus for potential development of novel therapeutic 
strategies to treat obesity and diabetes. Our data suggest that ‘browning’ 
of WAT by higher sympathetic tone and activation of BAT might lead 
to increased energy expenditure in Irx3-knockout mice and partially 
account for their leanness and protection from diet-induced obesity. 
Browning of WAT by increased sympathetic activity is a phenom- 
enon controlled by hypothalamic circuits integrating central and peri- 
pheral metabolism regulation. Our human eQTL data suggest that the 
obesity-associated SNPs are associated with IRX3 expression in brain. 
We determined that Irx3 is expressed in the arcuate nucleus and med- 
ian eminence of the hypothalamus, two critical regions involved in the 
regulation of energy homeostasis (Fig. 3m)”***. To investigate a pos- 
sible role of Irx3 expression in the hypothalamus mediating the body 
composition and energy homeostasis phenotypes in Irx3-knockout 
mice, we used Rosa26°"*"”? conditional transgenic mice expressing 
a dominant negative form of Irx3 (EnR-Irx3) by crossing with Ins2-Cre 
(Extended Data Fig. 9a, b)”**°. This design allows for disrupting Irx3 
function while preserving the architecture of genomic interactions be- 
tween the Irx3 promoter and long-range regulatory sequences includ- 
ing the Fto obesity-associated region. Interestingly, EnR-Irx3;Ins2-Cre 
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Figure 4 | Hypothalamus-specific dominant-negative Irx3 mice 
recapitulate the metabolic phenotype of Irx3-deficient mice. a, Body weight 
of control (EnR-Irx3) and mutant mice (EnR-Irx3;Ins2-Cre) fed a normal diet. 
b, Fat and lean mass ratio as a percentage of body weight. c, Haematoxylin 

& eosin (H&E)-stained sections of IWAT, PWAT and BAT. d, Glucose 
tolerance test. e, Energy expenditure corrected for lean mass. f, Quantitative 
PCR in PWAT in control and mutant mice. Data are expressed as 

mean + s.e.m. *P < 0.05 compared to control group. Additional details are 
given in Methods. 
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mice phenocopy the physiological, histological, and molecular meta- 
bolism phenotypes of germline Irx3-knockout mice (Fig. 4a-d and Ex- 
tended Data Fig. 9c, d). Energy expenditure was significantly higher in 
EnR-Irx3;Ins2-Cre mice with similar food intake and locomotor activ- 
ity (Fig. 4e and Extended Data Fig. 10a—e). Notably, these mice also 
showed ‘browning’ of WAT as well as activation of BAT (Fig. 4f and 
Extended Data Fig. 10f, g). Together, our data support the notion that 
the hypothalamic expression of Irx3 regulates energy homeostasis and 
body composition. 

Our results do not dispute the idea emerging from animal models 
that Fto is also a regulator of body mass and composition. The functional 
link between FTO and body mass regulation originated from animal 
models, unequivocally demonstrating that variation in FTO expression 
results in variation in body mass phenotypes. But to assess the null 
expectation that genetic manipulations in mice may result in variation 
in these parameters, we searched among 7,556 targeted mouse gene 
knockout models for evidence of alterations in body size, mass and 
growth (see Methods). Nearly one-third (2,166; 29%) of gene knock- 
outs in mice show alteration of these phenotypes, underscoring that 
animal models alone are not sufficient to definitively establish a func- 
tional relationship between a given gene and long-range noncoding 
variants (Supplementary Table 4). It is the aggregate of our data on 
chromatin looping, patterns of in vivo enhancer function, human eQTL 
mapping and mouse model data that collectively place IRX3 as an 
important target gene of the genetic association with human obesity. 
These observations have direct implications for how to interpret the 
associations characterized for thousands of noncoding variants with 
human disease and related quantitative traits. We observe more than 150 
genomic regulatory blocks, similar to that containing FTO and IRX3, 
overlapping with noncoding SNPs associated with human disease and 
complex traits in GWAS (Methods and Supplementary Table 5), high- 
lighting the need for a careful experimental pipeline to define the target 
gene(s) for each disease-associated noncoding SNP. 

Our data posit that the obesity-associated SNPs within FTO are func- 
tionally connected with regulation of IRX3 expression, and that IRX3 is 
an important determinant of body mass and composition. IRX3 encodes 
a transcription factor highly expressed in brain. Although our data rep- 
resent the first demonstration of the intersection of IRX3 biology with 
body mass composition and metabolism, previous work identified IRX3 
overexpression in adipocytes as a hallmark of the molecular switch seen 
in patients after profound weight loss following bariatric surgery”*. 
These data indicate that IRX3 may have important roles regulating 
metabolism beyond the ones we describe associated with Irx3 express- 
ion in hypothalamus. Future investigations will determine the precise 
molecular mechanisms by which IRX3 regulates metabolic parameters. 


METHODS SUMMARY 


For 4C-seq, chromatin was digested with DpnII and Csp6I. Captured DNA was 
amplified using promoter-specific primers and deep sequenced. 

For 3C, nuclei were digested with HindIII. Primer quality was assessed using serial 
dilutions of BACs encompassing the regions of interest (RP23-2680 10, RP23-96F3). 
The average of four independent experiments is represented graphically (Extended 
Data Fig. 2c). 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


3C assays. 3C assays were performed as described in ref. 27. Mouse brain tissue 
was processed to get single cells samples. Ten-million isolated cells were treated 
with lysis buffer (10 mM Tris-HCl, pH 8, 10 mM NaCl, 0.3% IGEPAL CA-630 (Sigma- 
Aldrich, 18896), 1X protease inhibitor cocktail (cCOmplete, Roche, 11697498001)) 
and nuclei were digested with HindIII endonuclease (Roche, 10798983001). Sub- 
sequently, DNA was ligated with T4 DNA ligase (Promega, M1804). A set of locus 
specific primers (see table) was designed with the online program primer3 v. 0.4.0 
(ref. 28), each one close to a HindIII restriction site. These primers were used to 
make real-time quantitative PCRs using a BioRad CFX96 Real Time System with 
SsoFast EvaGreen Supermix (BioRad 172-5203), in order to measure the relative 
enrichment in each ligation product. The primer designed within the linkage 
disequilibrium-block was used as fixed primer. The quality of all primer pairs 
was measured using serial dilutions of a bacterial artificial chromosome (BAC) 
DNA that encompass the regions of interest (RP23-268010, RP23-96F3). PCR 
values were normalized by means of control primers designed in the Ercc3 gene 
locus. The average of four independent experiments is represented graphically 
(Extended Data Fig. 2c). 

4C-seq. 4C-seq assays were performed as previously reported*”””*’. Whole embryos 
(9.5 days of development) or whole adult (8 weeks) mice brains (both strain CD-1; 
Charles River) were processed to get approximately 10-million isolated cells, which 
were treated with lysis buffer (10 mM Tris-HCl pH 8, 10 mM NaCl, 0.3% IGEPAL 
CA-630 (Sigma-Aldrich, 18896), 1X protease inhibitor cocktail (COmplete, Roche, 
11697498001)). Nuclei were digested with DpnII endonuclease (New England 
Biolabs, R0543M) and ligated with T4 DNA ligase (Promega, M1804). Subsequently, 
Csp6I endonuclease (Fermentas, Thermo Scientific, FD0214) was used in a second 
round of digestion, and the DNA was ligated again. Specific primers were designed 
near each gene promoter with primer3. Viewpoint fragend coordinates (for mouse 
genome build m9) were as follows: Fto, chromosome 8, 93,837,310-93,837,724; 
Irx3, chromosome 8, 94,325,277-94,326,009. Illumina adaptors were included 
in the primers sequence**. Sixteen PCRs were performed with Expand Long 
Template PCR System (Roche, 11759060001) for each viewpoint, pooled together 
and purified using High Pure PCR Product Purification Kit (Roche, 11732668001). 
Quanti-iT PicoGreen dsDNA Assay Kit (Invitrogen, P11496) was used in order to 
measure sample concentration, and then deep sequenced. 4C-seq data were ana- 
lysed, with some changes, as described in ref. 30. 

In brief, raw sequencing data were de-multiplexed and aligned using either the 
Mouse July 2007 (NCBI37/mm9) or Zebrafish December 2008 (Zv8/danRer6) 
assembly as the reference genome. Reads located in fragments flanked by two restric- 
tion sites of the same enzyme, or in fragments smaller than 40 bp were filtered out. 
Mapped reads were then converted to reads-per-1kb-bin units, and smoothed 
using a running mean 5-fragment window algorithm. To calculate the statistically 
significant targets for each viewpoint, a background theoretical model was calcu- 
lated as the exponential fit curve of the average signal of 61 4C-seq different 
samples''****, The P value for each bin was calculated by means of Poisson prob- 
ability function. Smoothed data were uploaded to the UCSC Browser”? for visu- 
alization and plotted using the browser and Circos**. For plotting, statistically 
significant targets were defined as P< 0.01. 

Primers used for 3C experiments. The following primers were used for 3C 
experiments: ‘3c_mouse_LDBlock’, 5'-TGGTCTCGGGTATCTTGTCC-3’; ‘3c_ 
mouse_Irx3_controll’, 5’-TCGCTCCTAGTGAGACTTTC-3’; ‘3c_mouse_Irx3_ 
promoter’, 5'-TGATGTTGGTTCCTTACTAGG-3’; ‘3c_mouse_Irx3_control2’, 
5'-TCCACAGAAACATCTGACG-3’; ‘3c_mouse_Ercc3-1’, 5’-TGACCTCCAC 
ACTCCTGAC-3’; ‘3c_mouse_Ercc3-2’, 5’-ATGCGCAATTAGAAACTGC-3’. 
Primers used for 4C-seq experiments. The following primers were used for 4C- 
seq experiments: ‘4c_mouse_Fto_read’, 5’-AATGATACGGCGACCACCGAAC 
ACTCTTTCCCTACACGACGCTCTTCC GATCTCATATTGCTCTGGATGCA 
GATC-3’; “4c_mouse_Fto_noread’, 5’-CAAGCAGAAGACGGCATACGAGAA 
CTACCCTTCCCAGAATGC-3’; ‘4c_mouse_Irx3_read’, 5’-AATGATACGGCG 
ACCACCGAACACTCTTTCCCTACACGACGCTCTTCCGAT CTCCGCCGC 
GGAGCAGATC-3’; “4c_mouse_Irx3_noread’, 5’-CAAGCAGAAGACGGCAT 
ACGATGGAGTCGCCAATCACC-3’; ‘4c_zebrafish_Fto_read’, 5’-AATGATAC 
GGCGACCACCGAACACTCTTTCCCTACACGACGCTCTTCCGAT CTCCT 
CCACTGTCATCCGATC-3’; ‘4c_zebrafish_Fto_noread’, 5’-CAAGCAGAAGA 
CGGCATACGACTACATTTTATGTCAGTTTCGGG-3’; ‘4c_zebrafish_Irx3a_read’, 
5'-AATGATACGGCGACCACCGAACACTCTTTCCCTACACGACGCTCTT 
CCGAT CTCCTACCGGATTACTCTACAGATC-3’; ‘4c_zebrafish_Irx3a_noread’, 
5'-CAAGCAGAAGACGGCATACGAAAAACGCCCAGAAGACTGAC-3’. 
ENCODE ChIA-PET. Publicly available data for the breast cancer MCEF7 cell line, 
generated as part of the ENCODE project”, were downloaded from and visualized with 
the WashU EpiGenome Browser” (http://epigenomegateway.wustl.edu/browser/). 
ChIA-PET has been performed to capture long-range chromatin interactions assoc- 
iated with RNA polymerase II’. 


Locus conservation. To assess the level of high-level conservation of the FTO-IRX5 
locus, we used phastCons“” elements and scores as a measurement of conservation. 

High-scoring phastCons elements (LOD > 680, top 1%) cover 3% (38,040 out 
of 1,244,409 bp) of the genomic space that includes FTO and IRX5 (hg19, chrom- 
some 16: 53,731,249-54,975,288), whereas on average, only 0.5 + 0.6% of genomic 
intervals with the same length are occupied by such high-scoring phastCons ele- 
ments. Genomic intervals of the same size of the FTO-IRX3 locus were generated 
every 10 kb for each chromosome. The FTO-IRXS is at the top 1% intervals contain- 
ing high-scoring phastCons elements. 

Similarly, in the FTO-IRX5 interval, 9% of the base pairs have phastCons score 
above 0.9 (range: 0.0-1.0), whereas 14,164 intervals from all chromosomes of the 
same size offset by 200 kb contain only 3.8% + 1.9 of such high-scoring base pairs. 
At 9%, the FTO-IRXS is at the top 2% intervals with the most conserved base pairs. 

These results indicate that the FTO-IRX5 interval contains significantly more 
highly conserved base pairs than most genomic loci. 

Mouse in vivo transgenic reporter assays. Enhancer reporter assays and con- 
structs, including the modified BAC, were created as reported previously“. The human 
(NCBI37/hg19) genomic coordinates of the elements cloned are: FTO-1, chromosome 
16, 53,800,270-53,805,184; FTO-2, chromosome 16, 53,808,432-53,813,121; FTO-3, 
chromosome 16, 53,816,312-53,821,148. FTO-promoter, chromosome 16, 53,736,860- 
53,738,096; IRX3-promoter, chromosome 16, 54,319,963-54,322,782. BAC RP11- 
261B9 was modified by inserting a LacZ-Amp cassette in place of the 36 nucleotides 
in exon 1 at chromosome 16, 53,738,097-53,738,132. This homologous recombina- 
tion was facilitated by using the following homology arms: FTO-hArm1, chromosome 
16, 53,738,047-53,738,096; FTO-hArm2, chromosome 16, 53,738,133-53,738,182. 

Mouse in vivo transgenic reporter assays were performed as reported prev- 
iously*’. In situ hybridization assays were performed according to standard pro- 
tocols”. The full-length mouse Fto coding DNA sequence (CDS) as riboprobe 
template was from IMAGE clone 5708558 (ATCC). 

Gene expression in human tissues. For the eQTL analyses, we used cerebellum”! 
(n = 153; publicly available in GEO under accession number GSE35974) and 
adipose*’ (n = 62; publicly available in GEO under accession number GSE40234); 
details of the samples, the quality control procedure and the normalization approach 
used were described previously’. For adipose, the samples were chosen to be at the 
tails of the distribution of insulin sensitivity matched on age, gender and natural 
log of BMI. Because the adipose samples included individuals of African ancestry, 
we performed principal component analysis and tested each marker in the locus 
for the additive effect of allele dosage on the residual expression phenotype (IRX3 
and FTO) after adjusting for the principal components (n = 2). Similarly, for 
cerebellum, we performed linear regression on the residual expression phenotype 
(IRX3 and FTO) after adjusting for sex and pH to evaluate the additive effect of 
allele dosage at each marker in the locus; the non-diseased samples (obtained from 
the Stanley Medical Research Institute) included in the analysis were of European 
descent and thus the principal components were not used to generate the residual 
expression. Significance (P value) was evaluated using the t-statistic from the regression. 
Single nucleotide polymorphisms associated with gene expression levels. We 
downloaded genome-wide association data from the Genetic Investigation of An- 
thropometric Traits” (GIANT) Consortium website (http://www.broadinstitute. 
org/collaboration/giant/index.php/GIANT_consortium_data_files#BMI_.28down 
load_GZIP.29). We extracted the P values for those SNPs in the locus with signifi- 
cant association (P < 0.05) with IRX3 or FTO expression in adipose or cerebellum. 
We plotted the distribution of association P values for these expression-associated 
SNPs (eSNPs). 189930506 is highly associated with BMI (P=1.41 X 10 °3, 
= 123541). Recombination rates are estimated from the International HapMap 
Project. 
Statistical analyses. Statistical analyses were carried out using the R statistical 
software (http://cran.r-project.org/). Regional plots were done by repurposing an R 
script retrieved from the Broad Institute (http://www.broadinstitute.org/diabetes/ 
scandinavs/figures.html). 
Mice. All animal experimental protocols approved by the Animal Care Committee 
of the Toronto Centre of Phenogenomics conformed to the standards of the Canadian 
Council on Animal Care. Irx3-deficient (Irx3-knockout) mice** and Ins2-Cre mice?* 
were described previously. Rosa26°"*" conditional transgenic mice (EnR-Irx3) 
were generated using pROSA26PA gene-targeting vector as described previously’. 
Mice were maintained on 12-h light-dark cycles and provided with food and water 
ad libitum. For diet-induced obesity studies, 8-week-old male mice were subjected 
to 45% high-fat diet (Research Diets) for 10 weeks. Body weight was measured 
every week from 4 to 18 weeks of age, and body composition was analysed using 
an EchoMRI device (Echo Medical Systems) at 18 weeks of age. Body mass index 
(BMI) was calculated by dividing body weight (g) by body length (mm) squared 
(BMI = body weight/body length’). 
Metabolic phenotyping experiments. Energy expenditure was evaluated through 
indirect calorimetry (Oxymax System, Columbus Instruments) over periods of 24h. 
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In brief, energy expenditure was calculated by multiplying oxygen consumption 
(VO2) by the calorific value (CV = 3.815 + 1.232 X respiratory exchange ratio) 
and corrected by lean mass. Locomotor activity and food intake were also mea- 
sured simultaneously. For glucose and insulin tolerance tests, mice were subjected 
to intraperitoneal bolus injection of glucose (1 mg g' of body weight) or insulin 
(1.5 milli-units per g) after fasting for overnight (14 to 16 h) or 6 h with water ad 
libitum, respectively. Blood glucose levels were measured at the indicated inter- 
vals. For histological analysis, tissues were fixed in 4% paraformaldehyde and 
embedded in paraffin. Sections of 5 um were stained with haematoxylin and eosin 
(H&E). Irx3 expression in brain and lung was examined using Irx3' knock-in 
mice with B-galactosidase staining. 

Gene and protein expression analyses. Total RNA was extracted from fat tissues 
or hypothalamus using RNeasy Lipid Tissue Kit (Qiagen), and complementary 
DNA was synthesized from 2 ug of RNA using M-MLV reverse transcriptase 
(Invitrogen) with oligo(dT). Gene expression assay was conducted using SYBR 
Green methods on Viia7 (Applied Biosystems), and relative cycle threshold (CT) 
values were normalized by f-actin. Primer sequences are available upon request. 
Western blot analysis was performed as described previously**. The same amount 
of protein extracted from hypothalamus or brown adipose tissues was loaded, 
which was confirmed by f-actin as a loading control. Antibodies used are as 
follows: Ucp1 (Santa Cruz Biotechnology), Irx3 (generated in-house) and -actin 
(Calbiochem). 

Statistics. All results are expressed as mean + s.e.m. Statistical significance of 
differences among groups was determined by Student’s t-test or analysis of vari- 
ance (ANOVA) with post-hoc analysis, Student-Newman-Keuls using Sigma Stat 
(SPSS). Differences at P< 0.05 were considered statistically significant (*P < 0.05 
versus WT or control; +P < 0.05 versus WT, HFD). The following numbers of mice 
per sample were used for comparison; body-weight measurement for Irx3 mice 
(ND, WT/knockout (KO), n = 32/25; HFD, WT/KO, n= 12/7) and EnR-Irx3 
mice (control/mutant, n = 9/12); body composition analysis for Irx3 mice (ND 
WT/KO, n = 21/12; HED WT/KO, n = 8/5) and EnR-Irx3 mice (control/mutant, 
n = 9/12); glucose tolerance test for Irx3 mice (ND 8 weeks, WT/KO, n = 8/5; ND 
18 weeks, WT/KO, n = 10/6; HFD 18 weeks, WT/KO, n = 12/7) and EnR-Irx3 
mice (control/mutant, n = 5/8); insulin tolerance test for Irx3 mice (WT/KO, 
n= 12/7); indirect calorimeter analysis for Irx3 mice(ND WT/KO, n= 7/5; 
HED WT/KO, n = 8/4) and EnR-Irx3 mice (control/mutant, n = 5/7), and gene 
expression analysis for Irx3 mice (WT/KO, n = 10/7) and EnR-Irx3 mice (control/ 
mutant, n = 5/7). 

Mouse body-mass phenotypes. To calculate the fraction of knockout mice that 
display a phenotype affecting body mass, we consulted the Mouse Genome 
Informatics (MGI) database”. 

First, to determine the number of gene knockouts that result in a body mass 
phenotype, we needed to define a more precise phenotype. The Mammalian 
Phenotype Ontology** (http://www.informatics.jax.org/searches/MP_form.shtml) 
specifies a controlled vocabulary by which all mice in the MGI database are clas- 
sified. Within this framework, (we believe) the most applicable is ‘abnormal post- 
natal growth/weight/body size’ (MP:0002089). 

Next, we crafted a query using the MouseMine portal” (http://www.mousemine. 
org) to access the MGI database, using MP:0002089 as the search criteria in the 
‘Mammalian phenotypes > Mouse genotypes’ template (http://www.mousemine. 
org/mousemine/template.do?name=MPhenotype_MFeature&scope=all). We asked 
for all targeted alleles on chromosomes 1-19, X and Y, reasoning that many non- 
targeted (that is, random, chemical and radiation-induced) mutations have not 
been precisely mapped, making it difficult if not impossible to ensure that they are 
not counted multiple times. Also, in the context of our study, we are discussing the 
chance that a targeted knockout would give a specific phenotype, so these types of 
alleles are most relevant. Finally, we excluded alleles present only as cell lines, 
which left 2,166 unique genes. 

To determine the total number of targeted mutations present in the MGI data- 

base, we performed a search identical to that above but omitting the MP:0002089 
term, which left 7,556 unique genes. 
Genomic regulatory blocks. Genomic regulatory blocks are regions under strong 
selection that contain a number of syntenic highly conserved sequences, believed 
to regulate a syntenic target gene (‘anchor gene’, the gene that is the raison d’étre of 
the GRB). There are different approaches to determine GRBs, usually identifying 
syntenic genes and non-coding conserved regions that span a region and deter- 
mining the boundaries of the region. 

GWAS identified SNPs that might affect noncoding regulatory elements that, in 
turn, regulate specific target genes. Assignment of regulatory elements to targets is 
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still a challenge, the simplest method being to assign them to the nearest gene. In 
the case of FTO, because the GWAS SNP occurs in an FTO intron, this gene was 
considered to be the target. 

The idea put forward here is that when regulatory elements occur within a GRB, 
which is the case for FTO, IRX3 and IRXS, it is likely that they target the anchor 
gene (IRX3 or IRX5), given its importance. Supplementary Table 5 contains an 
annotation of GRBs and their anchor genes, and the GWAS SNPs within the GRB, 
as well as annotation of whether the nearest gene is the anchor gene or not. In those 
cases in which the nearest gene differs from the anchor gene, we propose that the 
anchor gene be used as the putative target and additional experimentation as shown 
in this work should be performed to determine precisely which is the target gene 
for the GWAS-affected regulatory element. 

Genomic regulatory blocks were obtained from UCNEbase (http://ccg.vital-it. 
ch/UCNEbase/)°°. Human UCNE clusters (hg19) were downloaded from http:// 
ccg.vital-it.ch/UCNEbase/data/download/clusters/hg19_clusters_coord.bed and are 
referred here as GRBs. 

The hg19 GWAS catalogue from the UCSC Genome Browser was downloaded 
and the tagging SNPs contained in GRBs with more than one gene (making SNP 
assigning doubtful) were kept for further annotation. SNPs occurring in exons were 
removed. The nearest gene was considered to be the RefSeq gene whose transcrip- 
tion start site was the nearest, or the gene where the SNP is contained. 
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Extended Data Figure 1 | Long-range interactions in mouse and zebrafish. 
4C-seq data for the Fto-Irx3 locus, visualized with the UCSC Genome Browser. 
a, Data (also shown in the circular plot in Fig. 1) generated using whole mouse 
embryos (E9.5), showing the frequency of interactions with the promoter of 
Irx3 (blue, top) or Fto (magenta, bottom). The background signal corrects for 
the strong correlation between (nonspecific) ligation events and the linear 


distance along the chromosome. Poisson statistical significance (—log(P value)) 
of the 4C-seq interactions over the background is plotted. Significant 
interactions (P < 0.01), ‘targets’, are displayed in black. b, As above for a but for 
adult mouse brain (8 weeks). c, As above for a but for whole zebrafish embryos 
(24h post fertilization). In all, the region orthologous to the obesity association 
interval in the first intron of Fto is highlighted in pink. 
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Extended Data Figure 2 | Long-range interactions at the FTO-IRX3 locus. 
a, ENCODE data for ChIA-PET using RNA polymerase 2 (POL2) in MCF7 
(human breast adenocarcinoma) cells shows interactions between IRX3 and the 
obesity association interval in the first intron of FTO. No interactions are 
observed between the FTO promoter and the association interval. These public 
data are available from and was visualized with the WashU EpiGenome 
Browser (http://epigenomegateway.wustl.edu/browser/). b, Hi-C data 


previously generated"! in human IMR-90 (fetal lung) cells. In the association 
interval, the IRX3 signal is stronger than the background (random) signal. 
However, the signal for FTO is not. c, 3C data generated with adult (8 weeks) 
mouse brain. Using bait (red circle) in the association interval (red rectangle), 
we observe more frequent interactions with the Irx3 promoter compared to 
control regions 1 and 2 that are 29 and 42 kb away, respectively, indicative of 
looping. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


oO 


3) 
<x 
(va) 
= 
Wwe 
2 
c 
° 
= 
oO 
L 
° 
£ 
3 
— 
oa } 
Brain Skeletal Pancreas 
muscle 
Cc 
« 
be 
Brain — Brain - sagittal Pancreas 
whole section 


Extended Data Figure 3 | Gene expression in mouse tissue. a, FTO 
expression in lung and brain, shown by RNA in situ hybridization for mouse 
Fto mRNA, in newborn (P1) mouse. Lungs and heart (left, whole organs) were 
processed simultaneously and in the same well as brain (right, sagittal section) 
so that the relatively higher expression in brain can be observed. b, LacZ. 
staining for B-galactosidase expression driven from the human FTO promoter. 
Top, the promoter-LacZ fusion is in the context of 162-kb of human genomic 
sequence carried in a BAC containing the first three exons of FTO, the entire 


GI tract Heart and 


lungs 


IRX3 Promoter-LacZ 


VAG 


Heart Lungs Kidney 


obesity-associated interval and any enhancers present. The broad expression is 
consistent with previous reports in human and mouse (see main text for 
references). At bottom, the promoter-LacZ construct is isolated: only the 1,237 
bp proximal to the transcriptional start site are included. Broad expression is 
recapitulated, indicating the robust transcriptional competency of the human 
FTO promoter. c, In contrast, the 2,820-bp proximal human IRX3 promoter 
is not sufficient to drive LacZ expression, which is consistent with an 
enhancer-dependent transcriptional control mechanism. 
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Extended Data Figure 4 | IRX3 expression in human brain. a, IRX3 different regions of human brain. Data were retrieved from Human Brain 
expression in human tissues including brain. Expression data, measured on Transcriptome data (http://www.molecularbrain.org). Amyg: amygdala; Caud 
Affymetrix HG-U133 arrays, were obtained from the Body Atlas, Tissues nuc: caudate nucleus; Cere: cerebellum; Corp Call: corpus callosum; DRG: 
(http://www.nextbio.com). The median expression across all 128 human dorsal root ganglion; Frnt Cort: frontal cortex; Hippo: hippocampus; Hypo: 


tissues from 1,068 arrays is shown by the red line. b, IRX3 expression in 11 hypothalamus; Pit: pituitary; Spine: spinal cord; Thal: thalamus. 
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Extended Data Figure 5 | Linkage disequilibrium in FTO. Linkage 
disequilibrium (LD) plot of (logarithm (base 10) of odds) (LOD) score from 
HapMap phase II European data set, visualized in the UCSC browser. LD 
blocks are outlined in black. Obesity-associated SNPs from the National 


Human Genome Research Institute (NHGRI) GWAS catalogue are shown 
above, in green, demonstrating why this LD block is considered to define the 
‘association interval’. 
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Extended Data Figure 6 | Irx3-knockout male mice are leaner with reduced 
adiposity. a, Representative photograph of WT and Irx3 KO mice fed ND at 18 
weeks of age. b, Representative anatomical views of WT and Irx3 KO mice fed 
ND. Yellow dotted lines depict subcutaneous IWAT (left) and visceral PWAT 
(right). c, Tissue weights as a percentage of body weight showed smaller fat pad 
sizes in Irx3 KO mice, compared to WT mice, in both ND and HED conditions. 
(ND, WT/KO, n = 20/12; HED, WT/KO, n = 8/5.) Data are mean + s.e.m. 
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(*P < 0.05 versus WT, ND; {P< 0.05 versus WT, HED). d, Representative 
H&E sections of PWAT, IWAT and BAT from ND mice demonstrated smaller 
adipocyte size in Irx3 KO mice than control mice. e, Quantitative PCR of 
WT and Irx3 KO PWAT for the indicated marker genes: leptin (/ep) and 
adiponectin (adipoq) are adipogenic markers, positively and negatively 
associated with adiposity, respectively; Mcp1 correlates positively with 
adiposity. (*P < 0.05 versus WT value.) (WT/KO, n = 10/7). 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


Female b 
45 As 
40 WT 
35 © Irx3KO | 
S 30 a * 
BD 25 > 
oO ao 24 
= 20 * = 
= = 
= 
8 15 = 
10 14 
5 
) ) 1 
45 67 8 9 1011121314 151617 18 CTRL Irx3 KO 
Age (week) 
6: d 
35 - 
90 5 * 
a= 80 4 A 
Ss 3 80 4 
c 25 | 70 4 
= s 60 
8 204 * Oo | 
o 8 50 / 
Be § 40 
Oo 
S 10 4 > 304 
re) To 
a a 20 4 * 
5 4 * 
10 4 
04 04 
Fat (g) Lean (g) Fat/BW(%) — Lean/BW(%) 
e MGWAT (Subcu) POWAT (Viseral) f 
= 124 
faa) 
ra S 104 
fe D 
oO D 
EB 
xe) 
B 6, 
= 
D> 4/1 
®o * 
fe) = - 
x ® 2 
2 B 
~ ke 0 T 
MGWAT BAT 


Extended Data Figure 7 | Irx3-knockout female mice are leaner with 
reduced adiposity. a, Body weight (BW) changes of WT and Irx3 KO female 
mice fed a normal diet (ND). (WT/KO: n = 15/14). b, BMI, calculated by BW/ 
BL? (BL, body length), is lower in Irx3 KO female mice. (WT/KO, n = 7/7). 
c, d, Body composition analysis showed reduced fat mass and to a lesser extent 
reduced lean mass in Irx3 KO female mice compared to WT mice, leading to 


decreased fat mass ratio (WT/KO, n = 9/8). e, Representative H&E-stained 
sections of mammary gland (MG) WAT and periovarian (PO) WAT revealed 
smaller adipocyte size in Irx3 KO female mice, compared to WT. f, MGWAT 
and BAT weights as a percentage of body weight (WT/KO, n = 4/5). Data are 
mean = s.e.m. (*P < 0.05 versus WT value.) 
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Extended Data Figure 8 | Higher energy expenditure of Irx3-knockout 
mice. a, Energy expenditure over a 24-h period, corrected for lean mass (kcal 
kg! h_!), for 18-week-old WT and Irx3 KO mice fed with ND and HED (ND 
WT/KO, n = 7/5; HFD WT/KO, n = 8/4). b, Locomotor activity of WT and 
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Irx3 KO mice. c, Average amount of food intake over a 24-h period with or 
without normalization to lean mass. d, Average locomotor activity measured 
over 24h. e, f, Elevated Ucp1 gene and protein expression in BAT (WT/KO, 
n= 7/6). Data are mean + s.e.m. *P < 0.05 versus WT value. 
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Extended Data Figure 9 | Hypothalamic-specific Irx3 dominant-negative control mice. c, Tissue weights as a percentage of body weight showed that fat 
mice are leaner with reduced adiposity. a, Schematic diagram of generationof pad sizes are smaller in mutant mice, compared to control mice. d, Reduced 
transgenic mice overexpressing dominant-negative Irx3 in the hypothalamus. _ leptin expression and increased adiponectin gene expression in PWAT of 

b, Immunoblot analysis showed EnR-Irx3 expression in the hypothalamus of | mutant mice (control/mutant, n = 5/7). Data are expressed as mean + s.e.m. 
mutant mice without affecting endogenous Irx3 expression, compared to *P < 0.05 compared to control group. 
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Extended Data Figure 10 | Higher energy expenditure of hypothalamic period with or without normalization to lean mass. e, Average locomotor 
dominant-negative Irx3 mice. a, Energy expenditure over a 24-h period, activity measured over 24 h. f, g, Elevated gene and protein expression of Ucp1 
corrected for lean mass (kcal kg? h_?), for 18-week-old mice. b, Locomotor in BAT of mutant mice (control/mutant, n = 5/7). Data are expressed as 
activity for mice in panel a. c, d, Average amount of food intake over a 24-h mean + s.e.m. *P < 0.05 compared to control group. 
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Endothelial Notch activity promotes angiogenesis 


and osteogenesis in bone 


Saravana K. Ramasamy'*, Anjali P. Kusumbe'*, Lin Wang! & Ralf H. Adams!? 


Blood vessel growth in the skeletal system and osteogenesis seem to 
be coupled, suggesting the existence of molecular crosstalk between 
endothelial and osteoblastic cells’”. Understanding the nature of 
the mechanisms linking angiogenesis and bone formation should 
be of great relevance for improved fracture healing or prevention of 
bone mass loss. Here we show that vascular growth in bone involves 
a specialized, tissue-specific form of angiogenesis. Notch signalling 
promotes endothelial cell proliferation and vessel growth in post- 
natal long bone, which is the opposite of the well-established func- 
tion of Notch and its ligand Dll4 in the endothelium of other organs 
and tumours**. Endothelial-cell-specific and inducible genetic dis- 
ruption of Notch signalling in mice not only impaired bone vessel 
morphology and growth, but also led to reduced osteogenesis, short- 
ening of long bones, chondrocyte defects, loss of trabeculae and 
decreased bone mass. On the basis of a series of genetic experiments, 
we conclude that skeletal defects in these mutants involved defective 
angiocrine release of Noggin from endothelial cells, which is posi- 
tively regulated by Notch. Administration of recombinant Noggin, 
a secreted antagonist of bone morphogenetic proteins, restored bone 
growth and mineralization, chondrocyte maturation, the formation of 
trabeculae and osteoprogenitor numbers in endothelial-cell-specific 
Notch pathway mutants. These findings establish a molecular frame- 
work coupling angiogenesis, angiocrine signals and osteogenesis, 
which may prove significant for the development of future thera- 
peutic applications. 

Dye injection and corrosion cast experiments had indicated that meta- 
physeal vessels in growing long bone are arranged in parallel, columnar 
strands with distal anastomotic interconnections”*. Improved bone pro- 
cessing and immunostaining (see Methods) allowed the detailed ana- 
lysis of these vessels, which were straight tubules with a diameter of 
5-10 um (Fig. 1a and Extended Data Fig. 1). At their distal end, next to 
growth plate chondrocytes, columns were linked by tubular arches, 
which carried blind-ended, bulb-shaped protrusions with a calibre of 
15-20 uum (Fig. 1a and Extended Data Fig, 2a-c). Osterix* osteoblasts and 
their progenitors were abundant around endothelial columns (Fig. 1a). 
Arch vessels extended filopodia towards the growth plate and were fully 
lumenized as demonstrated by Dextran perfusion (Fig. 1b), expression 
oflumen markers like ICAM2 or Podocalyxin, and electron microscopy 
(Extended Data Fig. 2d-f). Thus, the most distal, leading endothelial 
structures in long bone were not sprouts but tubular arches carrying 
bulb-like protrusions. 

5-ethynyl-2’-deoxyuridine (EdU) labelling revealed that endothelial 
cell (EC) proliferation predominantly occurred within metaphyseal 
columns and, less prominently, in arches (Fig. 1c and Extended Data 
Fig. 2g). Endomucin (Emcn), a sialoprotein expressed by non-arterial 
endothelial cells’, allowed the identification of arterioles in long bone 
as thin (5 1m) CD31*Emcn_ vessels. Arterioles terminated in CD31* 
Emcn* vascular arches (Fig. 1d). As shown in the accompanying 
manuscript®, CD31"Emcn"™ (referred to here as type H) endothelial 
cells, as found in vessel columns and arches, display high proliferation 
rates and mediate vascular growth in bone. 


Known regulators of vessel growth were highly expressed in freshly 
isolated type H endothelial cells relative to the much more abundant 
CD31°Emcn'"° (type L) cells of the sinusoidal vasculature of the mar- 
row cavity (Extended Data Fig. 3a). Immunostaining confirmed high 
expression of Neuropilin 1, Plexin D1, Unc5b or FLT4/VEGER3 in vessel 
arches and columns (Extended Data Fig. 3b-g). These data indicate a 
specialized mode of angiogenesis in bone in which type H vessel arches 
and columns are the functional equivalent of endothelial sprouts. 

Notch negatively regulates angiogenesis in embryos, murine retinas 
or tumours**”. Notch and Delta-like 4 (Dll4), which signal in a cell- 
cell contact-dependent fashion between endothelial cells, limit sprout- 
ing and mitosis in the growing vasculature. In long bone, Dll4 was 
highly expressed by arteries and vessel arches. Jagged1, another Notch 
ligand controlling angiogenesis, was found in vessel arches and columns, 
but also in perivascular mesenchymal (osteoprogenitor) cells (Extended 
Data Fig. 3h, i). Inducible inactivation of the Rbpj gene encoding RBP-Jk, 
an essential mediator of Notch-induced gene transcription, in postnatal 
endothelium unexpectedly decreased CD31"Emcn™ vessels and the 
number of total and proliferating endothelial cells in long bone (Fig. le-i 
and Extended Data Fig. 4a). The column/arch pattern was disrupted in 
Rbpj'*"“ tibiae, and filopodia extension from mutant vessels was dis- 
organized (Fig. le, g and Extended Data Fig. 4c—e). For Notch gain-of- 
function experiments, we targeted the gene for Fbxw7, which mediates 
polyubiquitination and proteasomal degradation of active Notch", with 
an inducible knockout strategy. Although mutants had strongly reduced 
retinal angiogenesis due to Notch overactivation", column/arch struc- 
tures and both total and EdU~* endothelial cells were more abundant in 
Fbxw7“"° long bone (Fig. 1f-i and Extended Data Fig. 4b, c). CD31" 
Emcn”™ vessel abundance was increased in Fbxw7“P° and strongly 
decreased in Rbpj*"© bone sections (Extended Data Fig. 4f-h). As Notch 
can positively regulate artery formation, Fbxw7'“*° bones contained 
numerous small arterioles (Extended Data Fig. 5a). 

Suppression of angiogenesis by Dll4/Notch has been linked to VEGF 
receptor downregulation”. In Rbpj“"° metaphyseal vessels, VEGFR1 
(also known as FLT1), VEGFR2 (also known as KDR) and VEGFR3 
(also known as FLT4) immunostaining was reduced, whereas Foxw7A£C 
samples contained more labelled vessels (Extended Data Fig. 5b-d). 
Transcripts for all three receptors were reduced in freshly sorted Rbpj'“"° 
bone endothelial cells but increased in Fbxw7“P° cells (Fig. 1) and 
Extended Data Fig. 5e, f). In contrast, mRNA for soluble VEGFR1 
(sFIt1), which antagonises VEGF-induced signalling, was strongly ele- 
vated in Rbpj'*"° samples and decreased in Fbxw7'*" endothelial cells 
(Fig. 1k). 

Inducible, endothelial-cell-specific overexpression of the Notch] intra- 
cellular domain (NICD) phenocopied the Fbxw7“"° results. CD31” 
Emcn” vessels and small arterioles were significantly more abundant 
in the mutant metaphysis (Extended Data Fig. 5g, h). Thus, although 
Notch inhibits vessel growth and endothelial proliferation in other 
organs, the pathway promotes bone angiogenesis. Expression of Notch 
target genes (Heyl, Hes5), modulators of angiogenesis (Jagl, Ephb4, 
Angpt2) and cell cycle regulators (Cdkn1a, Cdkn1b, Cdkn2a) were also 
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different between freshly isolated lung and bone endothelial cells 
(Extended Data Fig. 6a-d). 

Defective angiogenesis in Rbpj’*"© mice was accompanied by shor- 
tened long bones (Fig. 2a and Extended Data Fig. 7a) and disorgani- 
zation of the metaphysis with large, irregular lacunae (Fig. 2b). Osteopontin 
immunostaining showed that mutants lacked clearly separated tra- 
beculae and contained numerous small, highly interconnected bony 
elements (Fig. 2c). Micro-computed tomography (1-CT) and histo- 
morphometric analysis showed significant loss of bone mass and density 
(that is, bone volume over total volume; BV/TV), number and thickness 
of trabeculae, and osteoid thickness in Rbpj'“"“ tibiae, without appre- 
ciable changes in bone-resorbing osteoclasts (Fig. 2d, e and Extended 
Data Fig. 7b-j). Calcein double labelling showed strongly reduced bone 
formation rates in Rbpj’“"“ bone (Fig. 2f, g). Endothelial-specific Notch 
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Figure 1 | Bone angiogenesis and regulation by Notch. a, Confocal images 
showing the metaphyseal vasculature in 4-week-old tibia. Osterix* (Osx, green) 
osteoblastic cells are closely associated with CD31~ (red) endothelial cells. 
Nuclei, 4’,6-diamidino-2-phenylindole (DAPI, blue). Note parallel arrangement 
of vessels pointing towards growth plate (gp). Right panel shows higher 
magnification of inset with bulb-shaped endothelial protrusions and filopodia 
(arrows). b, Dextran perfusion of lumen in distal vessel within metaphysis of 
4-week-old tibia. c, Maximum intensity projection showing proliferating 
(EdU‘, green) cells in metaphysis. Endothelial cells, Emcn (red); nuclei, DAPI 
(blue). Note abundant EdU~ endothelial cells in columns. Inset shows single 
plane of EdU* arch endothelial cells. d, 10-week-old tibia showing connection 
between CD31* Emen™ arteriole (arrows) and CD31*Emcn”™ vascular arch 
(arrowhead). e, Defective metaphyseal vasculature and enlarged growth plate 
(gp, arrows) in Rbpj“®© tibia. Dashed lines, cortical bone; nuclei, DAPI (blue). 
f, Distal arches (per mm length of metaphysis) were reduced in Rbpj'“F° tibiae 
and increased in Fbxw7'4*© samples. (n = 6 mice bones from 4 independent 
litters), error bars, + s.e.m. P values, two-tailed unpaired t-tests. g, Confocal 
images showing defective CD31* vessels (red) in Rbpj'*®° tibia and 
increased Fbxw74"© vessel density. Nuclei, DAPI (blue). h, i, Flow cytometric 
quantitation of percentage of total (CD45 Terl119~ CD317~) endothelial cells 
(h) (n = 6 mice from 4 independent litters) and EdU* endothelial cells 

(i) (n = 5 mice from 4 independent litters) in Ropja®> or Fbxw7*®° bone 
samples relative to controls. Error bars, + s.e.m. P values, two-tailed unpaired 
t-tests. j, k, qPCR analysis of VEGFR2 (Kdr) and soluble VEGFR1 (sFlt1) in 
sorted Rbpj**° and Fbxw7!F° bone endothelial cells (n = 4 mice from 4 
independent litters). Error bars, + s.e.m. P values, two-tailed unpaired f-test. 


activation in Foxw7“"© mutants had the opposite effect (Extended Data 
Fig. 7m-Yt). 

Osterix* cell numbers were significantly increased in the Rbpj*" meta- 
physis (Fig. 2h, i). Conversely, stabilization of active Notch in Foxw74®° 
mice significantly enhanced femur extension and bone growth and re- 
duced Osterix™ cell abundance without appreciable effect on osteoclasts 
(Fig. 2) and Extended Data Figs 7j, k and 8a). Bglap (bone gamma- 
carboxyglutamate protein) and Ibsp (integrin-binding sialoprotein) expres- 
sion were reduced in Rbpj'“"“ bones (Fig. 2k) indicating that accumulating 
Osterix™ cells were immature osteoblasts. Runx2* early osteoprogeni- 
tors, which give rise to Osterix” osteoblasts'’, were strongly reduced in 
Rbpj'*"© samples (Fig. 21, m) but more abundant in Fbxw7'*"© mutants 
(Fig. 2n and Extended Data Fig. 8a). Confirming the accumulation of 
osteoprogenitors, primary mesenchymal cells from Rbpj'“"° bone gen- 
erated mineral nodules within 10-14 days, whereas controls required 
21-28 days (Extended Data Fig. 8b and data not shown). These data 
suggested that Notch-dependent, endothelial-cell-derived signals con- 
trol osteoprogenitor differentiation. Arguing against defective oxygena- 
tion as the cause of these changes, pimonidazole and anti-HIF 1 staining 
were not appreciably altered in Rbpj'“"© long bone (Extended Data 
Fig. 8c, d). 

To identify the relevant Notch ligand in bone vessels, Cdh5(PAC)- 
CreERT2 transgenics with endothelial-cell-specific expression of tamoxifen- 
inducible Cre recombinase were combined with conditional DIll (ref. 13), 
Dil4 (ref. 14) or Jag1 (encoding Jagged1)** mice, respectively. Vessel 
growth and patterning was not visibly altered in the DIl1'*®© or 
Jag1'“"© mutant skeleton (Extended Data Fig. 8e). Dll4‘“*° long bones 
were substantially shorter and phenocopied Rbpj’*"“ defects (Extended 
Data Fig. 8f, g) including loss of trabeculae, appearance of highly inter- 
connected bony elements, accumulation of Osterix™ cells and reduc- 
tion of Runx2™ early osteoprogenitors (Fig. 3a, g and data not shown). 
Highly similar Dil4°"° and Rbpj'“"° phenotypes argued that RBP-Jk is 
essential for Notch signalling and not primarily acting as a transcrip- 
tional repressor’®. 

Rbpj“"© and Dil4'*© mutants displayed enlarged and irregularly 
shaped growth plates (Figs le, 2c and 3a). While labelling of mitotic 
chondrocytes did not reveal alterations in the proliferating zone, mutant 
maturation/hypertrophy zones were significantly enlarged and expres- 
sion of Sox9, which regulates chondrocyte differentiation and hypertrophy, 
was lost (Fig. 3c—-e and Extended Data Fig. 8h, i). VEGF-A expression 
by mature and hypertrophic chondrocytes” was also strongly reduced 
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Figure 2 | Endothelial Notch signalling regulates osteogenesis. a, Decreased 
length of freshly dissected Rbpj'“"© femurs compared to control littermates. 

b, Haematoxylin & eosin (H&E) stained longitudinal Rbpj*"° and control tibia 
sections. c, Osteopontin (Opn) immunostaining showing defective formation 
of trabeculae in P28 Rbpj'“*° tibia. d, Three-dimensional 1-CT reconstruction 


of 4-week-old control and Rbpj'“*° metaphysis. e, Reduced trabecular bone 


volume density measured as bone volume/total volume (BV/TV) and 
trabecular bone thickness (Tb.Th) in Rbpj®®© mice. f, g, Bone formation rate 
per bone surface (BFR/BS, f) calculated by calcein double labelling (7 day time 
interval) confirmed decreased bone formation in P28 Rbpj mutants. Arrows in 
g mark distance between calcein-labelled layers (n = 10 mice from 6 
independent litters). Error bars, + s.e.m. P value, two-tailed unpaired t-tests. 


(Fig. 3f and Extended Data Fig. 8), k). The size of Fbxw7“"° growth 
plates and maturation/hypertrophy zones was reduced, whereas Sox9 
expression was maintained (Extended Data Fig. 9a—c, f, g). Vegfa tran- 
scripts were elevated in Fbxw7“" bone samples in sharp contrast to 
the reduction in Rbpj'*"© tibiae (Extended Data Fig. 9d, e). 

Mesenchymal Notch expression promotes proliferation and sup- 
presses osteoblast differentiation’*””. As Dil4 expression is transcrip- 
tionally regulated by Notch and RBP-Jk’, endothelial Dll4 might signal 
through Notch receptors on perivascular osteoprogenitors. However, 
targeting of Rbpj in postnatal osteoblastic cells with Tg(Collal-creERT2) 
6.1.ICS transgenic mice (Extended Data Fig. 9h-j) with the same experi- 
mental schedule as for endothelial-cell-specific mutants did not lead to 
appreciable defects in the metaphyseal vasculature or osteoprogenitor 
cells (Fig. 3b and Extended Data Fig. 9k and data not shown). Reinstating 
Notch signalling in endothelial cells by combining Dil4‘4¥© and NICD 
overexpressing mutants (NICD'°' *°) rescued the structural altera- 
tions seen in the DII4‘4F° metaphysis and bone vasculature, enabled 
trabeculae formation and restored growth plate size, Sox9 expression 
and osteoprogenitor numbers (Fig. 3g and Extended Data Fig. 9l-p). 
Thus, endothelial-cell-specific and cell-autonomous Notch activity 
regulates bone angiogenesis and couples it to osteogenesis. 
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h, Osterix (Osx) immunostaining shows strongly increased osteoprogenitor 
numbers in Rbpj'“"© metaphysis. i, j, Quantitation of metaphyseal Osx” cells in 
RbpjAF© (i) and Fbxw7“"© mutants (j) relative to controls (n = 6 mice from 4 
independent litters). Error bars, + s.e.m. P values, two-tailed unpaired t-tests. 
k, qPCR analysis showing reduced expression of mature osteoblast markers 
(Bglap, Ibsp) in Rbpj'*"© bones (n = 6 mice from 4 independent litters). 

1, Immunostaining showing decrease of Runx2* early osteoprogenitors in 
4-week-old Rbpj“"© tibiae. m, n, Quantitation of metaphyseal Runx2* cells in 
Ropja®> (m) and Fbxw7“£° mutants (n) relative to littermate controls (n=6 
mice from 4 independent litters). Error bars + s.e.m. P values, two-tailed 
unpaired f-tests. 


Recent studies have highlighted the importance of angiocrine factors 
for organ growth and regeneration”’. To identify potential endothelial- 
cell-derived signals acting on osteoblastic cells, endothelial cells isolated 
from Fbxw7“"© and control bone were screened by a candidate approach. 
Tgfb2, Bmp4, Fef1, Fgf8, Wnt1, Wnt3a, Wntl0b, Dkk1 and Pef (encod- 
ing Plgf) transcripts were increased in Fbxw7“"© bone endothelium 
(Fig. 4a). The highest upregulation (40-fold) was observed for NogmRNA 
encoding Noggin, a secreted antagonist of bone morphogenetic proteins 
(BMPs). Other secreted BMP antagonists, namely Chordin (Chrd) and 
Cerberus 1 (Cer1), were not detectable in FACS-isolated bone endothelial 
cells (Fig. 4a). As additional evidence for Noggin as a Notch-controlled 
angiocrine regulator, Nog expression was significantly reduced in puri- 
fied Rbpj*"° bone endothelial cells (Fig. 4b). In control tibiae, Noggin 
immunostaining was detected in endothelial columns/arches and sur- 
rounding perivascular cells. In diaphysis, Noggin was detectable at lower 
level and without pronounced vessel association (Extended Data Fig. 
10a, b). Consistent with its known role in bone formation’, Noggin 
inhibited primary mesenchymal cell differentiation into osteoblasts 
in vitro (Fig. 4c). Whereas global Nog gene knockouts were embryonic 
lethal, heterozygotes and osteoblast-specific mutants developed skel- 


etal defects reminiscent of Rbpj‘*"© mice including impaired bone 
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Figure 3 | Osteogenesis requires EC-autonomous Notch signalling. 

a, b, Osteopontin (Opn) immunostaining showing malformed bone and 
growth plate (gp, dashed lines) in 4-week-old pil4'“¥¢ (a) but not in osteoblast- 
specific Rbpj mutant (Rbpj'°*) tibiae (b). c, d, Confocal images of control and 
Rbpj'*®° EdU-labelled proliferating zones (PZ, c) and Sox9 immunostained 
maturation/hypertrophy zones (MHZ, d). Note diminished Sox9 in Rbpj'*®° 
mutant. Dashed lines mark PZ and MHZ borders. SOC, secondary ossification 
centre; nuclei, DAPI (blue). e, Quantitative analysis of the sizes of RopjA®> 
mutant growth plates and maturation/hypertrophy zones (n = 6 mice from 4 
independent litters). Error bars, + s.e.m. P values, two-tailed unpaired t-test. 
f, Strongly reduced VEGE-A immunostaining (green) in P28 Rbpj'*"° mutant 
growth plate (gp) chondrocytes. g, Quantitative analysis of Osx” (left) and 
Runx2* (right) cells in control, Dil4, NICD or Dil4* NICD tibiae, as indicated. 
Controls are littermate animals without Cre expression (n = 4 mice from 4 
independent litters). Error bars, + s.e.m. P values, one-way ANOVA with 


Bonferroni’s multiple comparison post-hoc test. 


formation, shortened femoral length, reduced trabeculae and oversized 
growth plates”. Daily injection of Rbpj'“?° mice with recombinant 
Noggin protein for 2 weeks before analysis at P28 restored metaphysis 
organization, improved bone formation and normalized the number of 
osteoprogenitors (Fig. 4d-f and Extended Data Fig. 10c-f). Exogenous 
Noggin also restored the organization of the Rbpj“"“ bone vasculature, 
growth plate size, and chondrocyte Sox9 and VEGF-A expression (Fig. 4g 
and Extended Data Fig. 10g-i). 

These findings identify Notch as an important positive regulator of 
vascular growth in bone, which is highly distinct from the DIl4/Notch- 
mediated suppression of endothelial cell mitosis and sprouting in other 
organs. Whereas previous work had already indicated that Osterix* 
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Figure 4 | Role of Notch-dependent, angiocrine Noggin expression. 
a, Tgfb1, Tgfb2, Tgfb3, Bmp2, Bmp4, Nog, Chrd, Cerl1, Fef1, Fgf8, Wnt1, Wnt3a, 
Wnt5a, Wnt10b, Dkk1 and Pgf mRNA expression in isolated bone endothelial 
cells from Fbxw7“£© mice normalized to littermate controls (dashed line) 
(n = 4 mice from 4 independent litters). Error bars, + s.e.m. P values, two- 
tailed unpaired t-test. b, qPCR analysis showing reduced Nog expression in 
endothelial cells sorted from Rbpj'*"© long bone mice (n = 4 mice from 4 
independent litters). Error bars, + s.e.m. P value, two-tailed unpaired t-test. 
c, Noggin-mediated inhibition of osteoblastic differentiation of cultured 
murine mesenchymal progenitors. Mineral nodule formation (Alizarin 
staining) was supressed, and alkaline phosphatase (ALP) retained by Noggin 
after 28 days of in vitro differentiation. d, e, Quantitation of Osx* (d) and 
Runx2°* cells (e) in Noggin-treated versus saline control and Rbpj'“" long 
bones (1 = 6 mice from 4 independent litters). Error bars, + s.e.m. P values, 
one-way ANOVA with Bonferroni’s multiple comparison post-hoc test. 
f, Calcein double labelling in Rbpj‘**© mutant or control tibiae treated with 
saline or Noggin, as indicated. Noggin strongly improved Rbpj'“F° mineral 
growth plate size and Sox9 


apposition rates. g, Noggin restored Rbpj'*"© 
expression (green) in the maturation/hypertrophy zone (marked by dashed 
lines). Nuclei, DAPI (blue). 


cells co-invade with growing vessels in experimental bone fractures’, 
we now establish a molecular framework coupling the activity of endo- 
thelial cells, chondrocytes and osteoprogenitors. Future work will address 
whether Notch and other pathways can be used to promote fracture 
healing or prevent pathological bone loss. 


METHODS SUMMARY 


For wild-type bone analysis, C57BL/6J male mice were used unless stated other- 
wise. All endothelial-cell-specific gene targeting experiments were performed with 
Cdh5(PAC)-CreERT2 transgenic mice’. For Rbpj deletion in the postnatal endo- 
thelium, mice carrying loxP-flanked Rbpj (Rbpj°*"°*) alleles’® and Cdh5(PAC)- 
CreERT2 transgenics were interbred. To induce Cre activity and gene deletion, 
offspring was injected with 500 1g tamoxifen (Sigma, T5648) intraperitoneally 
every day from P10 to P14. The resulting Rbpj“®° (CreERT2"* Rbpj*"*) mutants 
and Cre- littermate controls were sacrificed at P28, and femurs and tibiae were 
collected for analysis. Identical breeding and tamoxifen administration strategies 
were used to generate EC-specific mutants with Foxw7** (ref. 10), DIL! 
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(ref. 13), Dl4!*" (ref. 14), or Jag! 'ox (ref, 15) mice and inducible osteoblast- 
specific Rbpj knockouts (Rbpj‘°"). The latter were generated with Tg(Collal- 
CreERT2)6.1.ICS transgenic mice (Institut Clinique de la Souris, France). 

For Notch gain-of-function experiments, Gt(ROSA)26Sor™ Notch )Dam) ice 
carrying a Cre-inducible transgene for Notch] intracellular domain overexpres- 
sion”® and Cdh5(PAC)-CreERT2 transgenics were interbred. Tamoxifen admin- 
istration (see above for injection schedule) was used to generate CreERT2-positive 
(NICD'CE£°) mutants overexpressing NICD in endothelial cells and controls. For 
experiments with endothelial-cell-specific Dll4'“*° NICD‘°F *© double mutants, 
interbreeding of DiIl4'°*"* conditional mice" with Gt(ROSA)26Sor°™! Notch) Dam} 
and Cdh5(PAC)-CreERT2 transgenics generated triple heterozygote males, which 
were bred to Dil4’’* Gt(ROSA)26Sor'™! NethYDamd double heterozygous females. 
This produced CreERT2-negative controls together with Dil4°*"°* NICD*/* 
CreERT2"* (pDil4'“®°) and Dil4*!* NICD*/°F CreERT2"* (NICD‘OF®% single 
mutants, and pDil4'“®°/NICD'OF¥¢ double mutants, which received tamoxifen and 
were analysed as described above. For mutant analysis, both male and female mice 
were used. 

For Noggin administration experiments, mice were injected intraperitoneally 
once daily with 500 1g per kg recombinant Noggin (R&D Systems) from P15 to P27, 
after completion of tamoxifen injections (P10-P14) and before analysis at P28. 

All animal experiments were performed in compliance with the relevant laws 
and institutional guidelines and were approved by local animal ethics committees. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 


Genetically modified mice. For wild-type bone analysis, C57BL/6J male mice 
were used unless stated otherwise. All endothelial-cell-specific gene targeting experi- 
ments were performed with Cdh5(PAC)-CreERT2 transgenic mice™*. For Rbpj dele- 
tion in the postnatal endothelium, mice carrying loxP-flanked Rbpj (Rbpj°*"*) 
alleles*® and Cdh5(PAC)-CreERT2 transgenics were interbred. To induce Cre activ- 
ity and gene deletion, offspring was injected with 500 jig tamoxifen (Sigma, T5648) 
intraperitoneally every day from P10 to P14. The resulting Rbpj'“®° (CreERT2"'* 
Rbpj°*"**) mutants and Cre- littermate controls were sacrificed at P28, and femurs 
and tibiae were collected for analysis. Identical breeding and tamoxifen administra- 
tion strategies were used to generate EC-specific mutants with Fbxw7°*" (ref. 10), 
DIIT'°*"* (ref, 13), Dll4!*/'"* (ref. 14), or Jag! lox (ref. 15) mice and inducible 
osteoblast-specific Rbpj knockouts (Rbpj°®). The latter were generated with 
T9(Collal-creERT2)6.1.ICS transgenic mice (Institut Clinique de la Souris, France). 

For Notch gain-of-function experiments, Gt(ROSA)26Sor'™ Notch) Dam) ice 
carrying a Cre-inducible transgene for Notch] intracellular domain overexpres- 
sion*® and Cdh5(PAC)-CreERT2 transgenics were interbred. Tamoxifen admin- 
istration (see above for injection schedule) was used to generate CreERT2-positive 
(NICD‘©££°) mutants overexpressing NICD in endothelial cells and controls. For 
experiments with endothelial-cell-specific DIl4“F° NICD'°?¥© double mutants, 
interbreeding of Dil4°*"™* conditional mice" with Gt(ROSA)26Sor°™ Neth) Pam) 
and Cdh5(PAC)-CreERT2 transgenics generated triple heterozygote males, which 
were bred to Dil4*’* Gt(ROSA)26Sor™ N*ehYP AMI double heterozygous females. 
This produced CreERT2-negative controls together with Dild'°*°* NICD*/* 
CreERT2"'* (DIl44®°) and Dil4*’* NICD*/°F CreERT2"* (NICD‘OPF°) single 
mutants, and Dil4'4®°/NICD'0F © double mutants, which received tamoxifen and 
were analysed as described above. For mutant analysis, both male and female mice 
were used. 

For Noggin administration experiments, mice were injected intraperitoneally 
once daily with 500 1g per kg recombinant Noggin (R&D Systems) from P15 to 
P27, after completion of tamoxifen injections (P10-P14) and before analysis at 
P28. 

All animal experiments were performed in compliance with the relevant laws 
and institutional guidelines and were approved by local animal ethics committees. 
Immunohistochemistry. Freshly dissected tibiae were collected from wild-type 
and mutant mice and their control littermates and immediately fixed in 4% ice- 
cold paraformaldehyde solution for 4h. Decalcification was carried out with 0.5 M 
EDTA at 4°C with constant shaking for 24h. Then the decalcified bones were 
immersed into 20% sucrose and 2% polyvinylpyrrolidone (PVP) for 24h. Finally, 
the tissues were embedded and frozen in 8% gelatin (porcine) in the presence of 
20% sucrose and 2% PVP and cryosectioned using low-profile blades on a Leica 
CM3050 cryostat. 

For phenotypic analysis, mutant mice and littermate controls were always 
processed, sectioned, stained, imaged and analysed together at the same conditions 
and settings. For immunostaining, bone sections were air-dried, permeabilized for 
10 min in 0.3% Triton X-100, blocked in 5% donkey serum at room temperature 
for 30 min. Blocked sections were probed with the primary antibodies diluted in 
5% donkey serum in PBS for 2h at room temperature or overnight at 4°C. After 
primary antibody incubation, sections were washed with PBS for three times and 
incubated with appropriate Alexa Fluor-coupled secondary antibodies (1:400, 
Molecular Probes) for 1h at room temperature. Nuclei were counterstained with 
DAPI. Sections were thoroughly washed with PBS before mounting them using 
FluoroMount-G (Southern Biotech). Finally, the slides were air-dried and sealed 
with nail polish. 

The following primary antibodies were used: Endomucin (sc-65495, Santa Cruz, 
diluted 1:100), Pecam1 conjugated to Alexa Fluor488 (FAB3628G, R&D Systems, 
1:100), Pecam1 (553370, BD Pharmingen, 1:100), Osterix (sc-22536-R, Santa Cruz, 
1:200), Runx2 (MAB2006, R&D Systems, 1:200), «-SMA-Cy3 (C6198, Sigma, 1:100), 
Hifl-o (ab65979, Abcam, 1:100), Collagen type 1 (AB675P, Millipore, 1:200), Oste- 
opontin (AF808, R&D Systems, 1:200), Calcitonin receptor (ab11042, Abcam, 1:75), 
Noggin (SC-25656, Santa Cruz, 1:200), Icam2 (553326, BD Pharmingen, 1:100), 
Podocalyxin (AF1556, R&D Systems, 1:100), green fluorescent protein (GFP) (A21311, 
Invitrogen, 1:200), VEGF-A (SC-152, Santa Cruz, 1:100), Sox9 (AF3075, R&D 
Systems, 1:100), Dll4 (AF1389, R&D Systems, 1:100), Jag] (SC-6011, Santa Cruz, 
1:200), VEGFR1 (NB100-57643, Novus Biologicals, 1:100), VEGFR2 (555307, 
BD Pharmingen, 1:100), VEGFR3 (AF743, R&D Systems, 1:100), Neuropilin 1 
(ab81321, Abcam, 1:100), Neuropilin 2 (AB10522, Millipore, 1:200), Plexin D1 (SC- 
67145, Santa Cruz, 1:200), Unc5b (ab104871, Abcam, 1:200), Robo 4 (ab10547, Abcam, 
1:100), Biotin-conjugated CD45 (553077, Becton Dickinson, 1:100), Biotin-conjugated 
Ter-119 (553672, Becton Dickinson, 1:100), 

For visualizing vascular lumina in the angiogenic front, tetramethyl rhodamine- 
conjugated fixable dextran of 2,000 kDa (Invitrogen, D7139) was perfused through 
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the heart of deeply anaesthetized mice. Mice were euthanized after 10 min and 
bones were processed further. 

For labelling of proliferating cells, mice were intraperitoneally injected with 
300 pkg EdU (Invitrogen) 3 h before euthanasia. Tibiae were immediately collected 
and processed as mentioned earlier. The bone sections were immunostained for 
EDU using Click-iT chemistry following the manufacturer’s protocols (Invitrogen). 

For metabolically labelling with the hypoxia probe pimonidazole (Pimo, Hypoxy- 

probe Inc.), mutant and control mice were intraperitoneally injected with 60 mg 
per kg Pimo for 2 h before euthanasia. Metabolized Pimo was detected by a rabbit 
antiserum against the non-oxidized, protein-conjugated form of pimonidazole 
(Hypoxyprobe Inc.). 
Image acquisition, processing and quantitative analysis. Immunofluorescent 
stainings were analysed at high resolution with a Zeiss LSM-780 laser scanning 
confocal microscope. Z-stacks of images were processed and 3D reconstructed 
with the Imaris software (version 7.00, Bitplane). Imaris, Image J, Photoshop and 
Illustrator (Adobe) softwares were used for image processing, in compliance with 
Nature’s guide for digital images. 

For all quantifications, a region of 300-400 tm from growth plate towards the 
caudal region was selected for all cell number quantifications in the metaphysis. All 
quantifications were done with Imaris and Image J software on high-resolution 
confocal images. 

Mesenchymal cell culture and differentiation. Primary mesenchymal cells were 
isolated from mouse bone and cultured following published protocols””’. In brief, 
freshly isolated long bones were crushed and then digested with collagenase at 
37 °C for 1h. Cells and small bone chips were washed thoroughly and cultured in 
2% FBS containing alpha- MEM for few days until adherent mesenchymal cells had 
migrated out from the bone fragments. The adherent cells were passaged further 
and maintained at 37 °C with 5% CO, in a humidified atmosphere. For osteogenic 
differentiation, StemXVivo osteogenic basal medium and its supplement (RnD 
systems, CCM009) were used. Continuous treatment of recombinant mouse Noggin 
(500 ng ml’, R&D systems) was provided to cells and, after 28 days, the appearance 
of mineral nodules and mesenchymal progenitors was analysed using Alizarin red 
and alkaline phosphatase staining, respectively. 

Quantitative RT-PCR (qRT-PCR). For the analysis of mRNA expression levels, 
pure endothelial cell fractions were flow-sorted directly into the lysis buffer of RNeasy 
Mini Kit (Qiagen). Total RNA was isolated according to manufacturer’s protocol. 
A total of 100-500 ng RNA per reaction was used to generate cDNA with the 
iScript cDNA Synthesis System (Bio-Rad). Quantitative PCR (qPCR) was per- 
formed using TaqMan gene expression assays on ABI PRISM 7900HT Sequence 
Detection System. The FAM-conjugated TaqMan probes Kdr, Flt4, membrane 
isoform Fit1, customised sFIt1, Nrp1, Plexind1, Unc5b, Robo4, Ntn1, Ntn3, Dil4, 
Jagl, Efnb2, Ephb4, Hes1, Heyl, Hey2, Hes5, Icam1, Angpt2, Cdh5, Cldn5, Pecam1, 
Cdkn2a, Cdkn1a, Cdkn1b, Ccnd1, Cdk2, Cdk4, Tgfb1, Tgfb2, Tgfb3, Bmp2, Bmp4, 
Nog, Chrd, Cer1, Fef1, Fgf8, Wntl, Wnt3a, Wnt5a, Wnt10b, Dkk1 and Pgf were 
used along with TaqMan Gene Expression Master Mix (Applied Biosystems) to 
perform qPCR. Gene expression assays were normalized to the endogenous VIC- 
conjugated Actb probes as control. For the analysis of Bgalp, Ibsp and Vegfa 
expression, RNA was isolated from the whole bone samples of mutants and litter- 
mate control animals. Freshly dissected femurs were immediately flushed (to 
remove haematopoietic cells), crushed and then lysed in lysis buffer. RNA isola- 
tion, cDNA preparation and qPCR were performed as described above. 

Flow cytometry. For flow cytometric analysis of endothelial cells, femurs and 
tibiae were collected, cleaned thoroughly to remove the adherent muscles and then 
crushed in ice-cold PBS with a mortar and pestle. Whole bone marrow was digested 
with collagenase at 37 °C for 20 min. Cells were counted and equal numbers of cells 
were subjected to immunostaining. Cells were stained with biotin-coupled CD45 
(BD, 553077) and Ter119 (BD, 559971) antibodies for 45 min. After washing in PBS, 
cells were stained with Streptavidin PE-Cy5 (BD, 554062) and CD31-conjugated 
with Alexa Fluor 488 (R&Dsystems, FAB3628G) for 45 min. After washing, data 
was acquired on FACS Aria IT or FACS Canto flow cytometers and analysed using 
FACSDiva software (Version 6.0, BD Bioscience). For analysis and sorting of CD31™ 
Emcn’ endothelial cells, first standard quadrant gates were set. Subsequently, to 
differentiate CD31™Emcn”™ cells from the total double positive cells in quadrant 2, 
gates were arbitrarily set at >10* log Fl-4 (CD31-APC) fluorescence and >10* log 
Fl-2 (Endomucin-PE) fluorescence. 

Micro-CT analysis, histomorphometry and electron microscopy. Tibiae were 
collected from mutants and their littermate controls; the attached soft tissue in the 
bone was removed thoroughly and fixed in 2% paraformaldehyde. The fixed tibiae 
were analysed using micro-CT (yCT 35) and software IPL V5.15 at Scanco Medical 
AG, Switzerland. A voxel size of 12 1m was chosen in all three spatial dimensions. 
For each sample, 148 from 232 slices were evaluated covering a total of 1.776 mm at 
a voltage of 70 kVp, intensity 114 1A, and integration time 1200 ms. 
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Calcein double labelling was performed to calculate bone formation rate (BFR) 
and mineral apposition rate (MAR). Mice were given intraperitoneal injections of 
10 mg per kg calcein (Sigma, C0875) dissolved in 2% sodium bicarbonate solution 
at the tenth day and third day before euthanasia. Bones were fixed in 4% PFA, 
embedded in 8% gelatin and 2% PVP and cryosectioned. Single plane images were 
acquired from the sections using LSM 780 (Carl Zeiss). Sections were stained with 
von Kossa method to assess mineralized bone. Representative images show cor- 
tical bone (diaphysis about 3mm proximally from the growth plate). Mineral 
apposition rates were calculated from both cortical and trabecular bones. The 
latter were measured at the proximal part of trabeculae relative to the angiogenic 
front in the metaphysis. Osteoclast surface/bone surface (Oc. S/BS; %) and osteo- 
clast number/bone perimeter (No. Oc./B. Pm) were calculated based on Calcitonin 
receptor staining of bone sections. 

For electron microscopy, freshly collected femurs were processed similar to the 
protocol published earlier’. In brief, fresh bone was directly transferred into 4% 
paraformaldehyde, 0.5% glutaraldehyde in 0.1 M cacodylate buffer (pH 7.2), cut 
and fixed initially for 2h at room temperature and overnight at 4°C. Bone was 
decalcified in 5% EDTA in 0.1 M cacodylate buffer (pH 7.2) for 3 days. The sample 
was post-fixed for 1.5 h in 1% OsmOg, 1.5% potassium ferrocyanide. Dehydration 
and embedding in epon was done under a slight vacuum. In all, 70 nm ultrathin 
sections of the sample were taken in the area of interest and collected on filmed 


single slot copper grids (Leica UC6 ultramicrotome, Vienna, Austria). Sections got 
counterstained with uranyl acetate and lead, and were analysed at 80 kV on a FEI- 
Tecnai 12 electron microscope (FEI, Eindhoven, The Netherlands). Pictures were 
taken with imaging plates (Ditabis, Pforzheim, Germany). 

Statistics. All data are presented as mean + s.e.m. The significance of difference in 
the mean values was determined using two-tailed Student’s t test unless otherwise 
mentioned. P< 0.05 was considered significant. In others, non-parametric one- 
way ANOVA was performed along with Bonferroni’s multiple comparison post- 
hoc tests to assess statistical significance with a 95% confidence interval. All 
calculations were performed using GraphPad Prism software. No randomization 
or blinding was used and no animals were excluded from analysis. Sample sizes 
were selected on the basis of previous experiments. Several independent experi- 
ments were performed to guarantee reproducibility of findings. 
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Extended Data Figure 1 | Schematic representation of key findings. growth in bone, which is the opposite of its role in other tissues. Notch activity 
Organization and role of growing vessels in the regulation of osteogenesis in in endothelial cells is also required for endothelial Noggin expression, controls 
postnatal long bone. Endothelial columns, which are embedded between the differentiation of perivascular osteoprogenitor cells and thereby 


segments of forming trabecular bone in the metaphysis, are interconnected by _ osteogenesis. Endothelial Notch signalling and Noggin also promote 

arches at their distal end. Blind-ended, lumen-containing protrusions extend _ chondrocyte maturation and hypertrophy, which affects angiogenesis through 
from arches towards growth plate chondrocytes, a key source of VEGF-A. VEGF-A expression. These signalling interactions between different cell types 
Endothelial Notch signalling promotes endothelial cell proliferation and vessel _ couple angiogenesis and osteogenesis. 
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Extended Data Figure 2 | Vessel growth in the postnatal metaphysis. 

a, b, Organisation of distal vessels in the metaphysis of 4-week-old tibia. 
Endothelial cells were visualized by anti-Endomucin (Emcn, red) 
immunostaining (a) or GFP expression (green) in Cdh5(PAC)-CreERT2 
Rosa26-mT/mG double transgenic mice (b) at 4 weeks of age. Note blunt 
appearance of most distal vessels (top) in proximity of growth plate 
chondrocytes. Nuclei, blue (DAPI). c, Maximum intensity projection showing 
the organization of distal, CD31-immunostained (type H) vessels in the tibial 
metaphysis at the indicated ages. Note emergence of blunt and blind-ended 
bulb-like protrusions (arrows) from arch vessels (arrowheads) at the distal end 
of endothelial columns. d, Confocal image showing expression of intercellular 
adhesion molecule 2 (ICAM2, red), a marker of lumenized vessels, in distal 
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P<0.0001 
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vessels of the tibial metaphysis. Nuclei, blue (DAPI). e, Podocalyxin (Podx, 
green), a sialoglycoprotein marking the apical surface of endothelial cells and 
thereby the vascular lumen (arrows), is present on the most distal, CD31- 
positive (red) vessel structures in the metaphysis. Nuclei, blue (DAPI). 

f, Transmission electron microscopy confirming the lumenized nature of distal 
vessels close to growth plate chondrocytes (ch). Yellow arrowheads indicate 
thin endothelial cells lining vessels. g, Quantitation of EdU labelled 
(proliferating) endothelial cells in the metaphysis of long bone. EdU* 
endothelial cells were predominantly present in columnar vessels and were 
comparably rare (=5%) in vascular arches (n = 5 mice from 3 independent 
experiments). Error bars, + s.e.m. P values, two-tailed unpaired t-test. 
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Extended Data Figure 3 | Marker expression in arch and bulb endothelial 
cells. a, Quantitative real time PCR analysis of transcripts encoding vessel 
guidance molecules in type H and type L endothelial cells isolated by FACS 
from 4-week-old femur. Note high levels of transcripts for Netrin-1 (Nétn1), 
Netrin-3 (Ntn3), Neuropilin 1 (Nrp1), Plexin D1 (Plxnd1), Unc5b and Robo4 in 
type H relative to type L endothelium (n = 5 mice from 4 independent 
experiments). Error bars, + s.e.m. P values, two-tailed unpaired t-test. 

b-f, Confocal images showing immunostaining of vessel guidance molecules 
such as Neuropilin 1 (b, Nrp1), Neuropilin 2 (c, Nrp2), Plexin D1 (d), Unc5b 
(e), Robo4 (f) in metaphyseal (type H) vessels and surrounding mesenchymal 
cells. High levels of Nrp1 were detected in type H endothelial cells (arrows), but 
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comparably lower expression was also observed in vessel-associated 
osteoprogenitors and growth plate chondrocytes (ch). Nrp2 is highly expressed 
in perivascular osteoprogenitors (arrowheads). High expression of PlexinD1 
and Unc5b, and lower levels of Robo4 were detected in type H vessel bulbs and 
columns (arrows). g, Maximum intensity projection showing high expression 
of VEGFR3 in endothelial bulbs and arches (white arrows) compared to 
columns. VEGFR3 was absent in the arteries. h, Image showing DIl4 expression 
in type H endothelial cells forming bulbs and arches in 4-week-old tibial 
metaphysis. With the exception of arteries (arrows), Dll4 expression was low in 
vessels of the diaphysis. i, Jag] expression was detected in endothelial cells 
(white arrows) and surrounding mesenchymal cells. 
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Extended Data Figure 4 | EC numbers, proliferation and type H vessels in 
Notch mutants. a, b, Representative flow-cytometric graph plots showing the 
quantification of total and EdU-labelled endothelial cells from Rbpj'*"° (a) or 
Foxw7AkC (b) mutant bones or corresponding littermate controls, as indicated. 
CD31* CD45" Terl19” endothelial cells were substantially reduced in 
Rbpj'“=© mutants (a). Conversely, endothelial cell number and proliferation 
were increased after inactivation of Fbxw7 (b). c, Maximum intensity 
projections of EdU-labelled (green fluorescence) tibial metaphysis showing 
proliferating cells in 4-week-old Notch gain-of-function (Fbxw7“*°) and loss- 
of-function (Rbpj*"°) mutants and corresponding littermate controls. 
Endothelial cells were visualized by Emcn immunostaining (red) and nuclei by 
DAPI (blue). d, Confocal images of Emcn (red) immunostained endothelial 
distal columns and arches next to growth plate chondrocytes (ch) highlighting 
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the distribution of filopodia (arrowheads). Note that filopodia were directed 
towards chondrocytes in controls whereas directionality was compromised in 
Rbpj'*®° mutants. e, Quantitation of filopodia indicating higher variability in 
Rbpj*¥© samples (n = 6 mice from 4 independent litters). Error bars, + s.e.m. 
P values, two-tailed unpaired f-test. f, Quantitative analysis of type H vessels 
in 4-week-old Fbxw7'“* mutants and littermate controls (n = 6 mice from 4 
independent litters). Error bars, + s.e.m. P values, two-tailed unpaired t-test. 
g, h, Maximum intensity projections of CD31 (green) and Endomucin (red) 
immunostained vessels in the metaphysis of 4-week-old Rbpj'“F° (g) and 
Fbxw7" mutants (h). Note loss of type H vessels and decreased CD31 (green) 
staining in the Rbpj'“"© metaphysis, while CD31* vessels were extended after 


endothelial-cell-specific inactivation of Fbxw7. Arrows indicate arteries. 
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Extended Data Figure 5 | Arterial specification and VEGF receptor 
expression in Notch mutants. a, Maximum intensity projections of Emcn 
(red) and CD31 (green) immunostained sections of 4-week-old tibia showing 
increase in CD31” (type H) vessels in Fbxw7“® mice. Mutants also displayed 
numerous small CD31* Endomucin’ arterioles (white arrows), which were 
associated with aSMAX cells (blue arrows in bottom panels). b-d, Confocal 
images showing 4-week-old Rbpj'*"° or Fbxw7'“¥ mutant, or corresponding 
littermate control metaphysis after immunostaining for different VEGF 
receptors. VEGFR2 was highest on control arches, bulbs and arteries and 
staining was strongly reduced in the Rbpj'“*° metaphysis (b). VEGFR3 
immunostaining decorated arches and bulb protrusion but was absent in the 
arteries. Staining was reduced in Rbpj'“© mutants but enhanced in Fbxw74F° 
vessels (c). Expression of VEGFR1 was predominantly found on perivascular 
mesenchymal and osteoprogenitor cells. Expression in these populations was 


not appreciably altered (d). e, f, qPCR analysis of sorted endothelial cells from 
Notch loss-of-function (Rbpj'“"“, e) and gain-of-function (Fboxw7“F%, f) mice. 
In endothelial cells, Notch positively regulated transcripts for the receptors 
VEGER2 (Kdr), VEGFR3 (Fit4) and membrane-anchored VEGFR1 (mFIt1). In 
contrast, expression of soluble Flt1 (sF/t1), a known antagonist of VEGF 
signalling, was increased in Rbpj'“"° endothelial cells and significantly reduced 
in Fbxw7“"° cells (n = 4 mice from 4 independent litters). Error bars, + s.e.m. 
P values, two-tailed unpaired t-test. g, Increased formation of CD31" (red) 
vessels in the metaphysis of 4-week-old Notch gain-of-function mice after 
endothelial-cell-specific overexpression of active Notch (NICD'0F ®°), Nuclei, 
blue (DAPI). h, Confocal images showing extensive formation of CD31* 
(green) Emcn (red) arterioles (arrows) in the NICDIOEFC metaphysis. Nuclei, 


DAPI (blue). 
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Extended Data Figure 6 | Gene expression in Rbpj mutant bone and lung 
endothelial cells. a, qPCR analysis of freshly isolated endothelial cells from 
control and Rbpj'“"© mutant lung or bone. Relative fold mRNA expression of 
Notch target genes such as Dil4, Jag1, Efnb2, EphB4, Hes1, Hey1, Hey2 is shown. 
Differently expressed genes are marked in red (n = 4 mice from 4 independent 
litters). Error bars, + s.e.m. P values, two-tailed unpaired t-test. b, Quantitative 
PCR analysis of expression the Notch target gene Hes5 in sorted endothelial 

cells from 4-week-old wild-type bone and lung (left), and from Rbpj'“*© mutant 
and littermate control bone samples (right). Note very low expression of Hes5 
in lung compared to bone endothelial cells and its reduction in the Rbpj“"© 
mutant bone endothelium. Data represent relative fold mRNA expression 


(n = 4 mice from 4 independent litters). Error bars, + s.e.m. P values, 
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two-tailed unpaired t-test. c, qPCR results for transcripts encoding intercellular 
adhesion molecule 1 (Icam1), angiopoietin 2 (Angpt2), VE-cadherin (Cdh5), 
and CD31 (Pecam1). Data represent relative fold mRNA expression. 
Differently expressed genes are marked in red (n = 4 mice from 4 independent 
litters). Error bars, + s.e.m. P values, two-tailed unpaired t-test. d, Relative fold 
mRNA expression of cyclin-dependent kinase inhibitor 2A (Cdkn2a), cyclin- 
dependent kinase inhibitor 1A (Cdkn1a), cyclin-dependent kinase inhibitor 1B 
(Cdkn1b), cyclin D1 (Cend1), cyclin-dependent kinase 2 (Cdk2), and cyclin- 
dependent kinase 4 (Cdk4) in control or Rbpj“"© mutant lung and bone 
endothelial cells (n = 4 mice from 4 independent litters). Error bars, + s.e.m. 
P values, two-tailed unpaired t-test. 
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Extended Data Figure 7 | Endothelial Notch controls osteogenesis. 
a-i, Analysis of bone parameters in 4-week-old Rbpj“"© mutants and control 
littermates. Data on femur length are shown in panel a. Connectivity density 
(number of connections per unit volume, b), trabecular bone separation (size of 
space separating trabeculae, c), trabecular number (number of trabeculae per 
mm length, d) were obtained by p)-CT. Mineral apposition rates (MAR, e) were 
calculated using calcein double labelling. Data on osteoclast number per bone 
perimeter (f), osteoclast surface per bone surface (g), osteoid percentage (h) and 
osteiod thickness (i) are based on histomorphometrical characterization of 
control and mutant samples (n = 6 mice from 4 independent litters). Error 
bars, + s.e.m. P values, two-tailed unpaired t-test. j, Osteoclasts were identified 
by immunostaining for Calcitonin receptor (green). Maximum intensity 
projection of Calcitonin receptor stained metaphysis region of tibial sections of 


Notch gain-of-function (Fbxw ) and loss-of-function (Rbpj**° 


along with their respective littermate controls. Nuclei, DAPI (blue). 

k-t, Analysis of bone parameters in 4-week-old Fbxw7*©© mutants and control 
littermates. Femur length is shown in k. 1-CT data (1) were used for the 
analysis of bone density (bone volume/total volume; BV/TV, m), connectivity 
density (number of connections per unit volume, n), trabecular bone separation 
(size of space separating trabeculae, 0), trabecular number (number of 
trabeculae per mm length, p), and trabecular thickness (q). Data on osteoclast 
number per bone perimeter (r), osteoclast surface/bone surface (Osteoclast 
S/B S) (s), and osteiod thickness (t) are based on histomorphometrical 
characterization of control and mutant samples (n = 6 mice from 4 
independent litters). Error bars, + s.e.m. P values, two-tailed unpaired t-test. 
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Extended Data Figure 8 | Endothelial Notch controls mesenchymal cell 
differentiation. a, Confocal images showing immunostaining for 
mesenchymal lineage markers Osx (green) and Runx2 (red) in Fbxw74*° and 
littermate control tibiae, as indicated. Nuclei, DAPI (blue). Note reduction of 
Osx* cells and increased Runx2+ population after targeting of Fbxw7 in 
endothelial cells. b, Analysis of differentiation capacity of primary 
mesenchymal cells isolated from Rbpj'“®° or control bone. Alizarin Red $ 
staining after 10 days in vitro (10 div) showed that mesenchymal cells isolated 
from endothelial-cell-specific Rbpj**© mutants generated mineral nodules (red 
arrows) prematurely in comparison to controls. c, d, Maximum intensity 
projection of stained Rbpj“"“ and control tibiae. Labelling of hypoxic cells with 
pimonidazole (Pimo, green) showed no appreciable differences between 
control and Rbpj‘*®° samples (c). Likewise, anti-HIF1o. immunostaining 
(green) was comparable in the control and Rbpj‘*"© metaphysis (d). 
Endothelial cells (Emcn antibody staining, red); nuclei, DAPI (blue). 


e, Representative confocal images showing that endothelial-cell-specific gene 


targeting of Jagged1 (Jag1) or Delta-like 1 (Dll1) did not lead to appreciable 
alterations in Jag1i*®° or Dil1'4*© mutant metaphyseal vasculature at 4 weeks. 
Dashed lines indicate vessel columns (C) and distal arches (A). Endothelial 
cells, CD31 (red); nuclei, DAPI (blue). f, Smaller size of freshly isolated femurs 
from 4 week-old Dil4“*° mutants relative to littermate controls. g, Disrupted 
organization of vessel arches (A) and columns (C) in the DIIGAEC metaphyseal 
vasculature. Endothelial cells, CD31 (red); nuclei, DAPI (blue). h, Quantitative 
analysis of size of chondrocyte proliferating zone in littermate control and 
Rbpj'*®© mutants (n = 6 mice from 4 independent litters). Error bars, + s.e.m. 
P values, two-tailed unpaired t-test. i, j, Quantification of Sox9 (i) and VEGF-A 
(j) fluorescence intensity in immunostained sections of Rbpj‘**© mutants 
showing decreased signals in mutant growth plates (n = 4 mice from 3 
independent litters). Error bars, + s.e.m. P values, two-tailed unpaired t-test. 
k, Maximum intensity projection of stained Rbpj'“®° and littermate control 


tibiae showing decreased VEGF-A immunosignals. 
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Extended Data Figure 9 | Endothelial-cell-dependent regulation of growth 
plate and bone. a-c, Quantitative analysis of Foxw7“"° mutant growth plates 
(a), proliferating zones (b) and maturation/hypertrophy zones (c) relative to 
control littermates (m = 4 mice from 3 independent litters). Error bars, + s.e.m. 
P values, two-tailed unpaired t-test. d, e, Quantitative PCR analysis of Vegfa 
transcripts level in the bones (without marrow cells) of Notch mutants 
(Rbpj'“"° and Fbxw7“F°) and respective littermate controls (n = 4 mice from 3 
independent litters). Error bars, + s.e.m. P values, two-tailed unpaired t-test. 
f, g, Maximum intensity projection of stained Foxw7“"© and littermate control 
tibiae. Note slight reduction of mutant growth plate and maturation/ 
hypertrophy zone (MHZ). EdU (red) labelling (f) marks mitotic chondrocytes 
in the proliferating zone (PZ). Sox9 (green) immunosignals (g) label maturing 
and hypertrophic chondrocytes. Dashed lines mark borders of PZ and MHZ. 
Nuclei, DAPI (blue). h, Maximum intensity projection of tile scanned tibial 
section of Collal-CreERT2 ROSA26-mT/mG double transgenic mice. GFP 
expression (green) indicates Cre-mediated recombination. i, Maximum 
intensity projections showing overlapping GFP (green) and Colla 
immunostaining (red). j, Maximum intensity projections showing osteoblasts 
expressing GFP (green). GFP signals were not seen in Emcn immunostained 


endothelial cells (red). k, Normal arrangement of CD31-stained endothelial 
columns (C) and arches (A) after Collal-CreERT2-mediated inactivation of 
Rbpj in cells of the osteoblast lineage (Rbpj'*°). Nuclei, DAPI (blue). 

1, m, Maximum intensity projections of tile-scanned tibia sections after 
anti-Emcn (red) immunostaining. Nuclei, DAPI (blue). Note profound 
disorganization of the Dil4'“F° metaphyseal vasculature (1) and restoration by 
simultaneous endothelial-cell-specific overexpression of active Notch in 
DIIZAEC/NICD'°F © double mutant mice (m). n, Tile scan confocal images of 
osteopontin (Opn, green) stained control and Dil4‘“°/NICD'O"*“ tibiae 
showing that endothelial NICD expression can rescue trabecular bone defects 
in the Dil4'“¥° background (see Fig. 2c for comparison). 0, Organization of 
the Emcn-stained metaphyseal vasculature and arrangement of osteoblasts 
(Osx) in tibia section from 4-week-old control, DIAS, NICDIOFFS, and 
DIl4*®NICD'OE EC mice, as indicated. Nuclei, DAPI (blue). p, Sox9 
immunostaining (green) in the epiphyseal region of 4-week-old tibia showing 
normalized maturing and hypertrophic chondrocytes zone (MHZ) in the 
growth plate of Dil4?°/NICD'OP F© mice. Dashed lines mark borders of MHZ. 
Nuclei, DAPI (blue). 
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Extended Data Figure 10 | Notch and angiocrine Noggin production. 

a, Noggin (green) in the wild-type tibial metaphysis was detected in Emcn* 
(red) endothelial cells as well as surrounding mesenchymal cells. In contrast, 
the diaphysis contained Noggin* haematopoietic cells, whereas only weak 
staining was seen in sinusoidal blood vessels. b, Panels show higher 
magnifications of insets in a. Nuclei, DAPI (blue). c, Confocal tile scans of 
osteopontin-immunostained tibia sections showing partial restoration of 
trabecular bone formation in 4-week-old Rbpj*"° mice after administration of 
recombinant Noggin. Left panels show saline-treated Rbpj'“© mutants and 
littermate controls. Nuclei, DAPI (blue). d, e, Noggin treatment restored the 
bone formation rate (BFR, d) and mineral apposition rate (MAR, e) in RopjA®> 
long bone to control level (n = 6 mice from 4 independent litters). Data 
represent mean + s.e.m. One-way ANOVA was performed along with 
Bonferroni’s multiple comparison post-hoc test. f, Systemic administration of 
recombinant Noggin protein reduced the number of Osx” cells (green) and 


increased Runx2* early osteoprogenitors in the Rbpj'“"° metaphysis in 
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comparison to vehicle-treated (saline) mutants. g, Maximum intensity 
projection of Emcn-immunostained control and Rbpj'“"© tibia sections after 
treatment with saline or recombinant Noggin, as indicated. Emcn staining 
intensity was increased in Noggin-treated Rbpj'“" samples, and the 
organization of endothelial column and arch structures was partially restored. 
Dashed lines indicate position of boundaries between endothelial arches (A) 
and columns (C) as seen in littermate control samples. h, i, Confocal images of 
VEGF-A (green) immunostained tibia sections showing growth plate 
chondrocytes in Rbpj'“®° mice after Noggin treatment. Note partial restoration 
(arrows) of VEGF-A expression in Noggin- but not vehicle control-treated 
(Saline) Rbpji*®° mutants (h). Nuclei, DAPI (blue). Quantification data 
showing fluorescence intensity (in arbitrary units) of VEGF-A expression 
recovered in the tibial sections of these animals (i) (n = 6 mice from 4 
independent litters). Data represent mean + s.e.m. One-way ANOVA was 


performed along with Bonferroni’s multiple comparison post-hoc test. 
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Two independent transcription initiation codes 
overlap on vertebrate core promoters 


Vanja Haberle!**, Nan Li**, Yavor Hadzhiev’, Charles Plessy*°, Christopher Previti®+, Chirag Nepal°+, Jochen Gehrig*+, 
Xianjun Dong°}, Altuna Akalin°+, Ana Maria Suzuki*®, Wilfred F. J. van Icken’, Olivier Armant®, Marco Ferg’, Uwe Strihle®, 


Piero Carninci*°, Ferenc Miiller® & Boris Lenhard?? 


A core promoter is a stretch of DNA surrounding the transcription 
start site (TSS) that integrates regulatory inputs’ and recruits gen- 
eral transcription factors to initiate transcription’. The nature and 
causative relationship of the DNA sequence and chromatin signals 
that govern the selection of most TSSs by RNA polymerase II remain 
unresolved. Maternal to zygotic transition represents the most marked 
change of the transcriptome repertoire in the vertebrate life cycle**. 
Early embryonic development in zebrafish is characterized by a series 
of transcriptionally silent cell cycles regulated by inherited maternal 
gene products: zygotic genome activation commences at the tenth 
cell cycle, marking the mid-blastula transition’. This transition pro- 
vides a unique opportunity to study the rules of TSS selection and 
the hierarchy of events linking transcription initiation with key chro- 
matin modifications. We analysed TSS usage during zebrafish early 
embryonic development at high resolution using cap analysis of 
gene expression®, and determined the positions of H3K4me3-marked 
promoter-associated nucleosomes’. Here we show that the transi- 
tion from the maternal to zygotic transcriptome is characterized by 
a switch between two fundamentally different modes of defining 
transcription initiation, which drive the dynamic change of TSS 
usage and promoter shape. A maternal-specific TSS selection, which 
requires an A/T-rich (W-box) motif, is replaced with a zygotic TSS 
selection grammar characterized by broader patterns of dinucleo- 
tide enrichments, precisely aligned with the first downstream (+1) 
nucleosome. The developmental dynamics of the H3K4me3-marked 
nucleosomes reveal their DNA-sequence-associated positioning at 
promoters before zygotic transcription and subsequent transcription- 
independent adjustment to the final position downstream of the 
zygotic TSS. The two TSS-defining grammars coexist, often physically 
overlapping, in core promoters of constitutively expressed genes to 
enable their expression in the two regulatory environments. The 
dissection of overlapping core promoter determinants represents 
a framework for future studies of promoter structure and function 
across different regulatory contexts. 

Mapping of TSSs using cap analysis of gene expression (CAGE)® iden- 
tified two major promoter classes with respect to TSS precision'®"!: ‘sharp’ 
promoters with one predominant TSS, often associated with a TATA-box 
that determines TSS selection; and ‘broad’ promoters with a wider dis- 
tribution of TSSs, often overlapping with a CpG island. Even with recent 
reports of the prevalence of known core promoter elements in human 
promoters’’, the actual mechanism for choosing TSSs within vertebrate 
promoters in various cell types and conditions remains unknown. 


To address developmental-stage-specific promoter usage through- 
out early embryonic development, we analysed a nucleotide-resolution 
map of transcription initiation events in the zebrafish genome, generated 
by CAGE across 12 stages from the unfertilized egg to organogenesis’’ 
(Fig. 1a). The data revealed numerous cases of promoter dynamics in 
which maternal messenger RNAs were initiated from different posi- 
tions than zygotic transcripts, often with shifting of TSS positions within 
a single promoter (Fig. 1a). 

Clustering of individual TSSs by expression profile revealed several 
major classes of TSS dynamics (Fig. 1b): TSSs present preferentially in 
maternal (before the mid-blastula transition (MBT)) stages, reflecting 
maternally inherited transcripts, as opposed to those activated in early 
or later zygotic stages (post-MBT). Additional clusters included con- 
stitutively present TSSs and TSSs with peak activity in the transitional 
stages, confirming major changes in the zebrafish transcriptome initiated 
at the MBT*"*. An equivalent clustering of entire promoters revealed a 
similar pattern (Extended Data Fig. 1a). However, promoters with no 
change in the overall expression level often contained a population of 
TSSs with very heterogeneous relative usage during development (Fig. la 
and Extended Data Fig. 1b-d). 

The observed differential TSS utilization between inherited (maternal) 
and de novo transcribed (zygotic) mRNAs suggested distinct rules for 
TSS selection acting within the same promoter in the oocyte and the 
embryo. To reveal the underlying signatures guiding differential pro- 
moter interpretation by the maternal and zygotic transcription machin- 
ery, we further dissected maternal- and zygotic-specific promoter usage. 
We first identified a subset of promoters similar to the example in Fig. 1a, 
showing a significant degree of shifting between maternally and zygo- 
tically used TSSs (Extended Data Fig. 2a, b). Our set contained 911 ‘shifting’ 
promoters whose CAGE signal in pre- and post-MBT stages overlapped 
by less than 40% (Supplementary Table 1). The TSS shift happened in 
either direction, but mainly within a narrow window of up to 100 base 
pairs (bp) (Extended Data Fig. 2c). The preferred maternal and zygotic 
TSS of those promoters showed antagonistic developmental dynamics, 
with degradation of inherited maternal transcripts and gradual activa- 
tion of zygotic ones (Extended Data Fig. 2d). 

Aligning sequences of shifting promoters by their maternal domi- 
nant TSS revealed a clear enrichment of T- and/or A-containing (WW) 
dinucleotides ~30 bp upstream of the maternal TSS, hinting at the pre- 
sence of a functional TATA-like element’ (Fig. 2a and Extended Data 
Fig. 2e). By contrast, the zygotic TSS did not show a TATA-like signal 
in the expected position, but had a sharp SS-WW boundary in local 
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C/Gand A/T dinucleotide enrichment precisely aligned ~50 bp down- 
stream of the zygotic TSS (Fig. 2a and Extended Data Fig. 2e). This sug- 
gests that there are two fundamentally different sequence signals guiding 
transcription initiation in the oocyte and the embryo. 

Only a small fraction of maternal TSSs (<10%) had a canonical 
TATA-box motif (Fig. 2b), whereas the majority contained other A/T- 
rich pentamers (Extended Data Fig. 2f). Motif discovery revealed the 
presence ofan A/T-rich motif (W-box) with lower information content 
than the canonical TATA-box, but equally positioned 30 bp upstream 
of the maternal TSS (Fig. 2c). By contrast, the zygotic TSS did not show 
the presence ofa TATA-box or W-box in the expected upstream region 
(Fig. 2b). This reveals a shift from W-box motif-dependent TSS selec- 
tion in the maternal transcriptome to zygotic W-box-independent TSS 
selection; that is, the existence of two major, independent mechanisms 
for defining transcription initiation acting on the same core promoter. 

We conjectured that the uncovered rules for maternal and zygotic 
TSS selection may apply generally to all constitutively expressed genes, 
even in the absence of clear TSS shifting. Dinucleotide analysis of all 
8,369 constitutively expressed promoters showed the same precise posi- 
tioning of the W-box signal upstream of the maternal dominant TSS, and 
alignment of the zygotic TSS with the downstream SS-WW boundary, 
as seen in the shifting promoters (Fig. 3a and Extended Data Fig. 3a—c). 
This confirmed a promoterome-wide distinction between determinants 
that govern TSS selection in the oocyte and the embryo, and demon- 
strated that complex TSS patterns in constitutively expressed promoters 
represent readouts of two independent grammars intertwined in the 
same core promoter regions. 

Finally, we showed that exclusively maternal and exclusively zygotic 
promoters also utilize the corresponding stage-specific TSS selection 
signals (Extended Data Fig. 3f-h). These results confirm a global change 
in promoter interpretation that constitutes a central part of the maternal 
to zygotic transition, with fundamental differences in the TSS selection 
mechanism used by the transcription machinery in the oocyte and the 
embryo. 

Fixed spacing between the motif and the TSS imposed by W-box- 
dependent initiation in the oocyte predicts the sharp TSS configuration”®. 
The set of maternal broad promoters, which seemingly contradicted 
the imposed constraints, revealed a novel promoter architecture com- 
posed of multiple individual relatively sharp CAGE tag clusters (TCs), 
each with its associated W-box at a fixed approximately —30 bp posi- 
tion (Extended Data Fig. 4). On the other hand, the exclusively zygotic 
promoters showed a less constrained distribution of TSSs, revealing 
the familiar shape of a broad promoter”, with the majority (>70%) 
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Figure 2 | Sequence signature of a large set of shifting promoters changes 
markedly during maternal to zygotic transition. a, Dinucleotide density 
(see Extended Data Fig. 10) at 911 shifting promoters sorted and aligned 
according to the distance and orientation of the TSS shift (schematics on the 
top; M, maternal stage; P1, P2, P3, individual promoters; Z, zygotic stage). 
Promoters were centred at either the maternal (left) or the zygotic (right) 
dominant TSS. Blue arrowhead indicates TA enrichment at the expected 
position of the TATA-box; red arrowhead indicates boundary between GC 
and TA enrichment ~50 bp downstream of zygotic TSS. b, Distribution of 
match (%) to TATA-box in the region —-35 to -22 bp upstream of the maternal 
(blue) and zygotic (red) dominant TSS (P value, two-tailed Wilcoxon rank-sum 
test). PWM, position weight matrix. c, Motif obtained by motif discovery 
upstream of maternal dominant TSS. 
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Figure 3 | Transition from maternal W-box-motif-dependent to zygotic 
nucleosome-positioning-signal-related transcription initiation is pervasive. 
a, Dinucleotide density at 8,369 constitutively expressed promoters sorted by 
the distance between the maternal and zygotic dominant TSS. Promoters were 
centred at either the maternal (left) or the zygotic (right) dominant TSS. Blue 
arrowhead indicates position of the maternal TSS-associated W-box; red 
arrowhead indicates the SS-WW boundary ~50 bp downstream of zygotic 
TSS; asterisks indicate GC enrichment in the internucleosomal region. 

b, Predicted maternal and zygotic codes in the sf3a2 promoter. Dinucleotide 
density and sequence of the wild-type (WT) and mutated (mut) sf3a2 promoter 
are shown on top. TSSs detected by CAGE in wild-type zebrafish in the 


containing only one broad TC (Extended Data Fig. 5a—c). Constitu- 
tively used promoters changed their shape accordingly, from a single 
or multiple sharp TSS configuration in the maternal stages, to a broad 
configuration in the zygotic stages (Extended Data Fig. 5d, e). Thus, 
the switch between the maternal and zygotic TSS is accompanied by a 
global change in the promoter architecture within the same region. 
To validate functionally the observed TSS selection grammars, we 
identified the W-boxes and dinucleotide frequency patterns in a con- 
stitutively active promoter (sf3a2) and mutated all W-boxes associated 
with maternal TSSs (Fig. 3b) for analysis in transgenic zebrafish. Fluo- 
rescence reporter activity and 5’ rapid amplification of cDNA ends 
(RACE) assays demonstrated that removal of all W-boxes did not influ- 
ence zygotic transgene activity or zygotic TSS selection (Extended Data 
Fig. 6), confirming W-box-independent promoter usage in the embryo. 
To validate W-box-dependent TSS selection in the oocyte, we analysed 
maternal TSS selection in early Fl generation embryos from stable 
transgenic lines with wild-type or mutated variants of the sf3a2 pro- 
moter. We developed a novel method (single-locus CAGE (SL-CAGE); 
Supplementary Table 2) for the detection and relative quantification of 
TSS usage at 1 bp resolution within the targeted promoter. TSS usage 
patterns of the wild-type sf3a2 transgene promoter in early embryos of 
several transgenic lines were highly reproducible and perfectly corre- 
lated with maternal TSS usage of the endogenous sf3a2 gene as well as 
with that seen by CAGE (Fig. 3b and Extended Data Fig. 7). The removal 
of W-boxes severely reduced the use of associated downstream posi- 
tions as TSSs in the mutant transgenic lines compared to the wild-type 
lines and led to an aberrant TSS usage pattern (Fig. 3b and Extended 
Data Fig. 7d; P< 0.01), confirming that the selection of these TSSs 
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depends on a W-box signal in the oocyte. These results indicate two 
independent TSS selection mechanisms used by the oocyte and the 
embryo within a single promoter. 

To address the relationship between stage-specific TSS selection and 
chromatin configuration, we analysed the positioning of H3K4me3- 
and H2A.Z-containing nucleosomes at core promoters by chromatin 
immunoprecipitation followed by sequencing (ChIP-seq). It was previ- 
ously shown that H3K4me3 marking on promoter-associated nucleo- 
somes precedes gene transcription during zebrafish genome activation”. 
Our data revealed precise positioning of the first downstream (+1) 
nucleosome ~50 bp from the preferred zygotic TSS, but no fixed spac- 
ing to the maternal TSS for all constitutively active promoters in the 
zygotic (prim 6) stage (Fig. 3c and Extended Data Fig. 3d). We observed 
a less sharp nucleosome alignment to the zygotic TSS in earlier stages, 
including the 512-cells stage, which precedes the onset of zygotic tran- 
scription. By contrast, no alignment of H3K4me3-marked nucleosomes 
to the maternal TSS was detected in any stage (Fig. 3c). These results 
revealed a positional interdependency between the zygotic TSS and 
the +1 nucleosome in the embryo as a feature of zygotic TSS selection 
grammar, independent of the W-box motif-guided TSS selection in the 
oocyte. Promoter-associated nucleosome alignment corresponded with 
alternating WW-SS patterns downstream of the zygotic TSS (asterisks 
in Fig. 3a, c), providing an internucleosomal position signal. At a higher 
resolution, the nucleosome-occupied DNA downstream of the zygotic 
TSS showed a 10 bp periodicity in AA and TT dinucleotide enrichment 
(Fig. 3d and Extended Data Fig. 3e), previously identified as an intra- 
nuclesomal positioning signal'®. The strong association of the zygotic, 
but not the maternal, TSS with these nucleosome positioning signals 
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suggests that TSS selection in a vertebrate oocyte is independent of 
inter- and intranucleosomal DNA signals. 

Recent efforts to identify sequence-based signals for nucleosome 
positioning’”'* and dynamic nucleosome organization at promoters'””° 
highlight the epigenetic and chromatin mechanisms*’” that, together 
with the DNA sequence, direct transcription initiation. The association 
of nucleosome positioning signals with zygotic promoter activity described 
here raises the question of whether promoter-associated nucleosome 
positioning contributes to regulation of positioning of transcription 
initiation, or is merely a consequence of transcription at the predefined 
position. To investigate this relationship we analysed the DNA sequence 
underlying +1 nucleosome positioning in the transcriptionally silent 
pre- (512 cells) and active post-MBT (prim 6) stages (Fig. 4a and Extended 
Data Fig. 8). In the pre-MBT stage, H3K4me3-marked nucleosomes 
occupied a CG/GC enriched region and centred at the peak of highest 
CG/GC enrichment, often directly overlapping the TSSs of the mater- 
nal transcripts, supporting the idea that H3K4me3 initially appears 
at CpG islands before transcription”. In the post-MBT stage, the +1 
nucleosome was positioned just downstream of the SS-WW enrich- 
ment boundary at ~50 bp from the zygotic TSS, occupying a WW- 
enriched region, with a small local GC/CG peak at the nucleosome 
midpoint (Fig. 4a and Extended Data Fig. 8b). Additional downstream 
nucleosomes followed a similar pattern of WW-enriched bound DNA 
alternating with internucleosomal SS enrichment. Local GC/CG enrich- 
ment at the nucleosome midpoints is in accordance with previously 
described nucleosome positioning preferences'*®; however, the addi- 
tional sequence preference complexity and its relation to TSS in dif- 
ferent developmental stages were not reported so far. The results show 
that the initial positioning of promoter-associated nucleosomes, which 
correlates with a broad internucleosomal phasing pattern, changes in 
later stages to a final precise positioning, which correlates tightly with 
the zygotic TSS and intranucleosomal phasing patterns, suggesting inter- 
dependence of the final nucleosome positioning and transcription. 


a 512 cells b Qblong 


To test this, we ranked throughout-active genes by the timing of onset 
of their zygotic transcription and analysed their H3K4me3-marked nucle- 
osome positioning patterns (Fig. 4b). No association between the timing 
of transcription activation and the precision of nucleosome position- 
ing was found, arguing against transcriptionally aided nucleosome 
readjustment and instead suggesting a pre-transcriptional process for 
repositioning nucleosomes to their final position, in agreement with 
transcription-independent positioning of nucleosomes at promoters 
in human cells”. Consistently, H3K4me3 ChIP-seq in TATA-box- 
binding protein (Tbp) knockdown embryos (Extended Data Fig. 9) 
showed no change in the overall H3K4me3 recruitment and nucleo- 
some positioning at Tbp-dependent genes (Fig. 4c), demonstrating 
that H3K4me3-marked nucleosome positioning at these genes does 
not require Tbp-dependent recruitment of the transcription initiation 
machinery or active transcription. 

The absence of a nucleosome-positioning sequence signature, as well as 
of precise nucleosome positioning at promoters with a canonical TATA- 
box in other systems”, together with narrow TSS peaks, argues in favour 
of the W-box as the overriding determinant of maternal TSS selection. 
The similarity of the W-box to the TATA-box suggests that transcrip- 
tion initiation in the oocyte may be mediated by the oocyte-enriched 
transcription nucleating factor Tbpl2 (also known as Tbp2)”°”’. Con- 
versely, early zygotic grammar prefers the TSS position to be at a fixed 
range from the precisely positioned +1 nucleosome, suggesting a mech- 
anism in which the initiation complex chooses initiator-like sequences 
within a ‘catchment area’ determined by the nucleosome position (Fig. 4d). 
This model is compatible with motif-independent TFIID recruitment 
by H3K4me3-TAF3 interactions** and emphasizes the interdepen- 
dence of nucleosome configuration at promoters with promoter type 
and physiological state in vertebrates”**° and yeast’?”?. 

Different TSS selection grammars deployed at separate promoters 
have been associated with different types of genes”, and a handful 
of promoters were shown to switch between TATA-dependent and 
-independent initiation*’. We show for the first time, to our knowledge, 
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Figure 4 | H3K4me3-marked nucleosome positioning reveals dynamic 
changes in underlying sequence signature and relation to TSS during 
maternal to zygotic transition. a, Frequency of dinucleotides centred on +1 
nucleosome of constitutively active promoters in maternal (512 cells) and 
zygotic (prim 6) stages. Centres of nucleosomes were estimated from subtracted 
H3K4me3 coverage (grey). Density of maternal and zygotic TSSs is shown in 
light blue and light red, respectively. b, H3K4me3 signal at promoters of 
constitutively present transcripts sorted by the time of activation of their 
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that the two grammars co-exist in close proximity or in physical overlap 
genome-wide, and are differentially used at thousands of promoters active 
in both the oocyte and the embryo. The multiple layers of information 
embedded in the same short sequence, each representing a different 
aspect of a complex regulation, are part of the reason why promoter 
codes have been so difficult to detect. Our findings on overlapping 
promoter grammars have implications for future analyses of promoter 
content and function. 


METHODS SUMMARY 


TSSs defined by CAGE were clustered into TCs, and TCs into promoter regions, 
allowing a maximal distance of 20 and 100 bp, respectively. Dinucleotide density 
was calculated by applying two-dimensional Gaussian kernel smoothing on a dinu- 
cleotide occurrence matrix. TSS usage in the sf3a2 promoter was analysed in 
transgene-injected embryos by 5’ RACE and in stable transgenic zebrafish lines 
by SL-CAGE. ChIP was carried out on MNase-treated chromatin using the ChIP- 
IT Express Enzymatic kit (Active Motif), and libraries were sequenced for 36 or 
56 bp on the HiSeq 2000 system. Minus-strand reads coverage was subtracted from 
plus-strand reads coverage for visualization and nucleosome midpoint estimation. 
Tbp was knocked down by injecting one-cell-stage zebrafish embyos with the tbp 
morpholino. 


Online Content Any additional Methods, Extended Data display items and Source 
Data are available in the online version of the paper; references unique to these 
sections appear only in the online paper. 
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METHODS 

CAGE tags mapping and CTSS calling. Sequenced CAGE tags (27 bp) as prev- 
iously described’* were mapped to a reference zebrafish genome (Zv9/danRer7 
assembly) using Bowtie*! with default parameters allowing up to two mismatches 
and keeping only uniquely mapped reads. An additional G nucleotide, which is 
often attached to the 5’ end of the tag by the template-free activity of the reverse 
transcriptase in the cDNA preparation step of the CAGE protocol”, was removed 
in cases in which it did not map to the genome. All unique 5’ ends of tags were 
considered as CAGE tag-defined TSSs (CTSSs) and the number of tags supporting 
each CTSS was counted. Raw tag count was normalized to a referent power-law 
distribution based on a total of 10° tags and % = —1.25, as described previously, 
resulting in normalized tags per million (tpm). All analyses were done in the R sta- 
tistical computing environment™ (http://www.R-project.org/) using Bioconductor*’ 
(http://www.bioconductor.org/) software packages and custom scripts. 

CTSS clustering into TCs and promoter regions. CTSSs supported by at least 
1tpm in at least one of the 12 developmental stages were clustered at two levels. 
First, TCs were created for each stage individually using a simple distance-based 
approach with a maximum allowed distance of 20 bp between two neighbouring 
CTSSs. Next, for each TC we calculated a cumulative distribution of the CAGE 
signal and determined the positions of the 10th and 90th percentile to obtain more 
robust boundaries of a TC. TCs across all developmental stages within 100 bp of 
each other were aggregated into a single promoter region. Only promoter regions 
supported by at least 5 tpm in at least one developmental stage were used in further 
analyses. 

Expression profiling. Expression profiling was done at two levels: individual CTSSs, 
and entire promoter regions. To minimize the noise from weakly supported CTSSs, 
we selected only CTSSs with at least 5 tpm in at least one developmental stage. 
Normalized tpm values across 12 developmental stages for each CTSS (or promoter 
region) were divided by their standard deviation to obtain scaled expression mea- 
sures. A self-organizing map”* (SOM) unsupervised learning algorithm was applied 
to distribute CTSSs (or promoter regions) across 5 X 5 = 25 expression profiles. 
Dinucleotide pattern analysis. To visualize dinucleotide composition patterns 
of sequences flanking TSSs, we first created an occurrence matrix (n X m; where 
n= number of sequences and m = length of sequences) for each individual dinu- 
cleotide, by placing 1 if the given dinucleotide is present at a given position or 0 ifit 
is not. Values in the matrix were then smoothened: at each position in the matrix 
the weighted average dinucleotide occurrence was calculated by taking into account 
surrounding positions. Weights of the surrounding positions were assigned by 
centring a two-dimensional Gaussian kernel with bandwidth = 3 (in both dimen- 
sions) at the central position. The matrix of smoothened values (densities) was 
visualized using different shades of blue in a map-like representation. Extended 
Data Fig. 10 illustrates how the calculation and visualization was done. 
TATA-box motif analysis. A TATA-box PWM was obtained from the JASPAR 
database*’ (http://jaspar.genereg.net/) and used to scan the region —35 bp to —22 bp 
upstream of the TSS (expected position for a TATA-box was determined as previ- 
ously described'*). For each promoter sequence a maximal detected match (%) to 
the TATA-box PWM was reported for the maternal and zygotic dominant TSS 
separately. Distribution of obtained values across all promoters was visualized by 
histograms. In addition, the frequency of the top ten most abundant pentamers 
found in the scanned sequences was shown. De novo motif discovery was performed 
on a set of 14bp long sequences spanning the region from —35 bp to —22bp 
upstream of the TSS using MEME” (http://meme.sdsc.edu/) with default para- 
meters. Only motifs with an E value = 0.01 were selected as significant. 

sf3a2 promoter reporter constructs. The region spanning 500 bp upstream and 
200 bp downstream of the dominant zygotic TSS in the sf3a2 promoter was chosen 
for validation of the TSS selection grammar. The sf3a2 promoter carries both mater- 
nal and zygotic promoter determinants and exhibits TSS shifting. The selected 
sequence, which ends within the first intron of sf3a2, was fused to a sequence 
containing the 3’ end of the zebrafish txnipa first intron and splice acceptor fused 
to an mCherry reporter (Extended Data Fig. 6a). Genomic DNA of AB* zebrafish 
strain was used for PCR amplification using Advantage HD DNA Polymerase Mix 
(Clontech). The amplified fragments were cloned into a pDB896 vector (provided 
by D. Balciunas and subsequently modified by replacing the y-crystallin:gfp with 
mCherry) using In-Fusion PCR Cloning System (Clontech) following the manu- 
facturer’s instructions. The expression cassette is flanked by Tol2 transposon arms 
for Tol2 transgenesis”. A polymorphic nucleotide (G>T) was identified in the 
promoter sequence at chr2:58,656,711 (Ensembl, Zebrafish Assembly Zv9/danRer7). 
The wild-type promoter reporter vector was used as a template for in site-directed 
mutagenesis PCR to introduce the mutation in the W-boxes as indicated in Fig. 3b. 
The resulting mutated PCR fragment was cloned in the same reporter vector using 
the In-Fusion PCR Cloning System. Sequences of all primers are provided in Extended 
Data Fig. 6b. 


Microinjection and transgene expression analysis. 1.5-2 nl of injection solution 
containing 20ng pl’ reporter plasmid DNA and 15ngyl | Tol2 transposase 
mRNA, supplemented with 0.1% Phenol red (injection marker), was injected into 
zebrafish eggs within 10-15 min after fertilization. In the automated imaging and 
expression analysis experiments, ecfp mRNA was added (30ng ul‘). mCherry 
reporter activity was measured in the prim 20 stage, in both embryos injected with 
wild-type and mutated sf3a2 promoter constructs by automated imaging as described 
previously*’. Embryo images were analysed with Zebrafish Miner software*’. The 
level of reporter expression was measured as pixel intensity value and normalized 
to the intensity of the ECFP signal (injection control) and averaged for all embryos 
in the experiment. In addition a percentage of expressing embryos was calculated 
from the total number of ECFP-positive embryos with an ECFP signal equal or 
above the detection threshold of the Zebrafish Miner software. Embryo images 
were wrapped onto reference embryo shape and overlaid by summing pixel inten- 
sity values. 

5’ RACE. 5’ RACE was performed with a FirstChoice RLM-RACE Kit (Life Tech- 
nologies) following the manufacturer’s protocol. Total RNA was isolated in the prim 
6 stage from ~100 phenotypically normal looking and reporter-gene-expressing 
zebrafish embryos injected with either the wild-type or mutated sf3a2 promoter 
reporter construct, using TRIZOL (Life Technologies) following the manufac- 
turer’s instructions. The PCR products of the expected size from the nested (inner) 
PCR reaction were purified from agarose gel and sequenced to identify the TSS. To 
demonstrate that the generated 5’ RACE products are specific to the 5’ ends of 
de-capped RNA, a ‘minus tobacco acid pyrophosphatase’ (TAP)-treated sample 
was put through adaptor ligation, reverse transcription and PCR. Sequences of 
primers used in 5’ RACE are provided in Extended Data Fig. 6f. 

Transgenic zebrafish lines. Transgenic zebrafish lines with the sf3a2 promoter 
(wild-type and mutated) reporter constructs were generated by microinjection of 
the corresponding construct into zebrafish zygotes as described earlier. The reporter- 
positive (mCherry) embryos were grown to adulthood and germline-transmitting 
female individuals were identified by crossing to wild-type zebrafish and selecting 
for the presence of reporter-expressing offspring. Transgene expression and TSS 
usage was analysed in F1 generation embryos. Experiments were carried out under 
licence by the Home Office Licence Number 40/3681 and PPL 40/3131. 
Quantification of the reporter mRNA levels by qPCR. RNA from reporter- 
expressing embryos in the high/sphere stage was isolated using GeneElute Total 
RNA extraction kit (Sigma-Aldrich), following the manufacturer’s instructions. 
qPCR was performed using the SYBR Green detection method on 7900HT Fast 
Real-Time PCR System (Applied Biosystems). Two primer pairs were used for both 
the mCherry reporter and the endogenous sf3a2 gene (normalization control). Tech- 
nical triplicates were run for each primer pair. The C; values were determined by 
the SDS v.2.4 software (Applied Biosystems), using a manual threshold of 0.2 and 
automatic base line. Expression levels of the transgene were calculated relative to 
the endogenous sf3a2 in the same sample, using the average C, values across tech- 
nical triplicates and both primer pairs. The sequence of the primers used in qPCR 
is provided in Extended Data Fig. 7c. 

SL-CAGE. We introduced a novel method for quantitative high-resolution detec- 
tion of TSSs and their usage within a targeted promoter, called single-locus deep 
CAGE. The method combines generation of 5’ complete cDNAs transcribed from 
capped mRNAs as described in the CAGE protocol® with the amplification of 
targeted cDNAs using gene-specific primers and subsequent high-throughput 
paired-end sequencing of the single locus (typically single promoter region) based 
library. Supplementary Table 2 describes the main steps of the protocol and pro- 
vides sequences of all primers used in different steps. Paired-end sequenced reads 
(34bp + 35 bp) were mapped to either the spliced sequence of the sf3a2:mCherry 
transgene or endogenous sf3a2, and TSS usage was reconstructed as described earlier 
for CAGE tags. 

ChIP. ChIP experiments were carried out using the ChIP-IT Express Enzymatic 
kit (Active Motif) in line with the manufacturer’s instructions. Chromatin was 
prepared using ~5,000 and ~3,000 embryos in the 512-cell and oblong stages, 
respectively. Embryos were dechorionated enzymatically using pronase and fixed 
in 1.85% formaldehyde in Hanks media for 20 min at room temperature (23 °C). 
The embryos were washed once with PBS and the fixation was stopped by incub- 
ating in 1X glycine for 10 min at room temperature followed by three washes with 
ice-cold PBS. Embryos were resuspended in 1 ml ice-cold lysis buffer, incubated on 
ice for 20 min, transferred to a pre-cooled dounce homogenizer and dounced by 10 
strokes. Nuclei were collected by centrifugation, resuspended in 200 kl digestion 
buffer and incubated at 37 °C for 5 min. Chromatin was sheared by adding 10 pl 
of enzymatic shearing cocktail working stock (200 U ml”) and incubating for 
10 min at 37 °C. Shearing efficiency was checked by gel electrophoresis according 
to manufacturer’s instructions. The reaction was stopped by adding 5 ul ice-cold 
0.5 M EDTA and incubating on ice for 10 min and sheared chromatin was cleared 
by centrifugation. For ChIP reactions 70 1 of sheared chromatin were mixed with 
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25 ul Protein G magnetic beads, 20 pl ChIP buffer 1, 1 pl protein inhibitor cocktail, 
4 ug of anti-H3K4me3 (Abcam ab8580) or anti-H2A.Z (Abcam ab4174) antibody 
or an equivalent volume of water (no antibody control), respectively, and water to 
a final volume of 200 pl. ChIP was performed in duplicates for each stage. ChIP 
reactions were then incubated overnight at 4°C while rotating. Magnetic beads 
were washed and incubated in elution buffer. After addition of reverse crosslinking 
buffer samples were decrosslinked for 4h at 65°C. Samples were treated with 
Proteinase K and RNase A and purified using pheonol chloroform extraction. 
Tbp knockdown. One-cell-stage embryos were injected with either f-actin:yfp" 
(1.7 nl, 42 pgnl’) or 1.3 kb upstream of the ntl TSS ntl:yfp” (1.7 nl, 68 pgnl ') 
constructs. The injected embryos were then split into four groups. One was kept as 
a non-injected control and the other three groups were further injected with one of 
the two tbp-targeting morpholinos (1.7 nl, 2.5 mM) or with a mismatch morpho- 
lino described previously**. All embryos were kept in E3 medium at 28.5 °C until 
the non-injected group reached the 30% epiboly stage (4.7 hpf) and were then 
analysed under a fluorescence stereoscope (Nikon SMZ1500). Arrest of epiboly 
movements, loss of B-actin:yfp (Tbp-dependent) and retention of ntl:yfp (Tbp- 
independent) reporter activities’ were used as markers for assessing and sorting 
Tbp morphants for ChIP analysis. Approximately 1,500 non-injected, 1,500 mismatch- 
morpholino-injected embryos, 1,200 thp mo1 morphants and 1,000 thp mo2 mor- 
phants were used for ChIP, as described earlier. We used a previously published 
set of genes downregulated in zebrafish Tbp morphants*’. Genes with log fold 
change =—1.5 were selected as Tbp dependent and were aligned with respect to 
the dominant TSS of the nearest promoter detected by CAGE for H3K4me3 ChIP- 
seq signal visualization. 

Deep sequencing of chromatin DNA. ChIP-seq was performed as described 
before“*. In brief, 10ng of ChIP DNA was end-repaired, ligated to single read 
adaptors, size selected and amplified for 18 cycles according to Illumina’s ChIP- 
seq protocol. Cluster generation was performed according to the Illumina Cluster 
Reagents preparation protocol (http://www. illumina.com/). Samples were sequenced 
for 36 bp or 56 bp (thp morphants and controls) on the HiSeq 2000 system. 
ChIP-seq data analysis. Sequenced reads were mapped to the reference zebrafish 
genome (Zv9/danRer7 assembly) using Bowtie*! with default parameters allowing 
up to two mismatches and keeping only uniquely mapped reads. Coverage was 
calculated for plus and minus strands separately using unextended reads and tak- 
ing maximum 20 reads mapping to exactly the same position. Minus-strand cover- 
age was subtracted from plus-strand coverage to obtain subtracted coverage, which 
was used for visualization and nucleosome midpoint estimation. Significantly 
enriched regions (peaks) were detected using model-based analysis for ChIP-seq 
(MACS)* (http://liulab.dfci.harvard.edu/MACS/) with default parameters. Mid- 
points of nucleosomes within significantly enriched regions (FDR = 0.01) were 
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estimated from subtracted coverage and the nearest CAGE signal was used to deter- 
mine strand specificity and relative position of the first downstream nucleosome. 
Data accession. Processed data are available for download at http://promshift. 
genereg.net/zebrafish/. Tracks can be visualized as annotated custom tracks in the 
UCSC Genome Browser using the following URLs: http://promshift.genereg.net/ 
zebrafish/CAGE_and_nucleosome_tracks.txt and http://promshift.genereg.net/ 
zebrafish/Transgenic_lines_sf3a2_SL-CAGE_tracks.txt. 
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Extended Data Figure 1 | Promoters contain TSSs with very heterogeneous __b, Projection of maternal- and zygotic-specific CTSSs (from Fig. 1b) onto the 
developmental expression profiles. a, Developmental expression profiles of | SOM classes of promoters shown in a. c, Fraction of promoters containing 
CAGE-defined promoters obtained by self-organizing map (SOM) clustering. _ either maternal-specific (cluster 4,4 in Fig. 1b) or zygotic-specific (cluster 0,0 in 
Each box represents one SOM cluster, with a series of beanplots showing Fig. 1b) CTSS, or both, for different classes of promoters established in a. 
distribution of scaled expression (logarithm of normalized number of CAGE __ d, Example of a throughout-active promoter that contains a very heterogeneous 
tags per million) at different time points for all promoters belonging to that population of CTSSs (colour coding of CTSSs as in Fig. 1b: maternal-specific 
cluster (number of promoters denoted above the box). Different classes of shown in blue, throughout-expressed shown in green, and zygotic-specific 
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Extended Data Figure 2 | Definition and properties of shifting promoters. 
a, Schematic representation of shifting promoter score calculation between two 
stages. Forward and reverse scores are calculated using forward (F1, F2) and 
reverse (F1’, F2') cumulative sums of CAGE tags along the promoter region, 
and the final shifting score is the maximum of the two. b, Heatmap showing the 
number of shifting promoters detected by applying the method shown in 

a (using a 0.6 score threshold) to all possible pairs of developmental stages. The 
majority of shifting was detected between the first six stages (predominantly 
maternal) and last three stages (predominantly zygotic). Promoters that 
qualified as shifting in at least four of these early versus late pairs were selected 
for a final set of 911 shifting promoters. c, Distribution of distance between 
maternal and zygotic dominant TSS (shift) for 911 shifting promoters. 
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Extended Data Figure 3 | Genome-wide sequence and chromatin signature __the plus and minus strand is shown. e, Density of TT dinucleotide in a +100 bp 
of maternal versus zygotic TSSs. a, Dinucleotide density for all throughout- _ region for the same set of promoters as in a. f, WW dinucleotide density for 
active promoters sorted by orientation-sensitive distance between the maternal- (left) and zygotic-specific (right) promoters centred at their 
maternal and zygotic dominant TSS. Promoters were centred at either the dominant TSS and sorted by decreasing match to the TATA-box (white 
maternal (left) or the zygotic (right) dominant TSS. b, Distribution of match dashed lines). Average density of individual dinucleotides is shown below. 
(%) to TATA-box PWM in the region —35 to —22 bp upstream ofthe maternal —_g, Distribution of match (%) to TATA-box PWM in the region -35 to -22 bp 


(blue) and zygotic (red) dominant TSS in constitutively active promoters. upstream of dominant TSS for maternal- (blue) and zygotic-specific (red) 
P value of the two-tailed Wilcoxon rank-sum test is shown. c, Relative promoters. P value of the two-tailed Wilcoxon rank-sum test is shown. 
frequency of top ten most frequent TATA-like pentamers upstream of the h, Relative frequency of the top ten most abundant TATA-like pentamers 
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TATA-box PWM is denoted in the brackets. d, H2A.Z signal in the prim 6 stage © TATA-box PWM is denoted in the brackets. 
at the same set of promoters as in a. Subtracted coverage of reads mapping to 
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Extended Data Figure 4 | Composite multiple sharp architecture of 
maternal-specific promoters. a, Distribution of promoter interquantile width 
(4o.1 — Jos; central promoter region that contains =80% of CAGE tags) for 
1,944 maternal-specific promoters. b, Number of TCs within the promoter 
region for maternal broad promoters (interquantile width >20 bp) across early 
developmental stages. Bubble size reflects relative number of promoters that 


contain given number of TCs. c, Comparison of dinucleotide frequencies 
around single dominant TSS in promoter (left) and dominant TSSs from all 
TCs within the same promoter (right). d, Composite architecture of a maternal 
broad promoter showing multiple sharp TCs with their associated W-box at the 
expected ~30 bp upstream position. 
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Extended Data Figure 5 | Change of promoter architecture during maternal 
to zygotic transition. a, Examples of maternal-specific, zygotic-specific and 
throughout-active promoters showing difference in architecture between 
maternal and zygotic stage. b, Number of TCs within promoter region for 
maternal-specific, zygotic-specific and constitutively active promoters. Bubble 
size reflects relative number of promoters that contain given number of TCs 
and different colours correspond to different developmental stages. 

c, Distribution of distance between neighbouring TCs within the same 
promoter region. d, Heatmap showing k-means clustering (k = 15) of 
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throughout-active promoters, based on the number of TCs that they contain 
throughout 12 developmental stages. e, Representative examples for three 
selected clusters: highly expressed sharp promoter whose architecture does not 
change and retains only one TC (left); promoter in which number of TCs 
decreases from five clearly separated sharp TCs in maternal stages to one fairly 
broad TC in zygotic stages (middle); promoter that changes its architecture 
from two clearly separated sharp TCs to one broader TC after maternal to 
zygotic transition (right). 
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Extended Data Figure 6 | Validation of sf3a2 promoter code in transgene 
injected embryos. a, Schematic of reporter construct; the dashed lines indicate 
the spliced out intronic sequence. SA, splice acceptor; SD, splice donor. 

b, Primers used for sf3a2 construct. c, Reporter gene expression in the wild-type 
and mutant sf3a2 construct-injected embryos (36 hpf). Bright field (BF) and red 
channel fluorescent images (mCherry) are shown. Scale bar, 250 jum. The 
panels on the right show red channel projection overlay of 24 (wild type (WT), 
top) and 31 (mutant (mut), bottom) embryos. d, Percentage of mCherry- 
expressing embryos from total ECFP-positive embryos (left) and overall 
reporter activity measured as average pixel intensity values from 24 wild-type 
and 31 mutant sf3a2 promoter-injected embryos (right). e, 5‘ RACE of sf3a2 
promoter construct. Fragments amplified by PCR using a forward primer 


Position relative to zygotic dominant TSS (bp) 


specific for the 5’ RACE adaptor and two different inner reverse primers 
specific for the mCherry reporter. The arrowheads indicate the PCR products 
of expected size. A, 5' RACE adaptor; IF, inner forward primer; IR1, inner 
reverse primer 1; IR2, inner reverse primer 2; M, DNA marker; mut, embryos 
injected with construct containing sf3a2 promoter with mutations disrupting 
W-boxes (as shown in f); TAP, tobacco acid pyrophosphatase; WT, embryos 
injected with construct containing wild-type sf3a2 promoter. f, Primers used in 
5’ RACE. g, Genomic sequence of the sf3a2 promoter with point mutations 
introduced to disrupt W-boxes (orange) marked in red. The track at the 
bottom shows TSSs detected by CAGE in wild-type zebrafish embryos in prim 6 
zygotic stage. Black arrow marks TSS confirmed by 5’ RACE in the zygotic stage 
in both the wild-type and the mutated sf3a2 promoter. 


©2014 Macmillan Publishers Limited. All rights reserved 


LETTER 


a d 


Seale 50 basesh | 
chr2: seeseseq, dd seesecad © saeseesd © ssesesed d  seessead 5 od © s8ese70d 
WT sequence cacaTcac & 6 TAT TCC ACACGCACAACTT TA’ 
MUT sequence cacaT¢. ? BCCACI L ACACGCACAACITG 
a2 = 
aa 
endogenous eo 
sf8a2 3 a 
“4 a EN 
WTL1F 
endogenous 
si3a2 
-340000 ue 
0 ; : g 
transgene 
ee Th oa = a 
it ‘i Ui ih ® 
ri " t " £ 
WTLE a ce id 2 
a Ha HG re 5 
-340000 _ bs ti a i aj 2 
\ | = ee 
i bt 1% El = 
transgene nH iy i fal nN 
i 1 hh isi % 
340000 Hay LT aT 1 
"01 y my y Q 
j ti ti Li = 
endogenous i ' i | i ' " 2 
sf3a2 By y 1” 1 = 
i ul ns i 
-340000 nm ! HE ! 5 
= Le cr a a Tt 
ransgene Vy 1 4 yy 
y | th | 
a | Li Vy 
340000 'y Lay y — 
0 'y | aT ey | 
endogenous i i i : 7 : i 
sf8a2 ' ! ! ! t ! \ ! 
-840000 i Ha He vil oi 
MUTL6F De eee ict 2 
r, & FL Tita, ij =| 
Hee aol | i = 
my y “fo " 
hig Vy 1g wy 
b 0 a ee f 
a i i q LT 7 7 
endogenous i } ; i | \ i 
g sf3a2 1 ; , ae 
> 104 ee 1 ‘ ‘ y 4 
£ SS 4 = 
oO y ie DLS : y =) 
< 1 ty re | ty S 
Q 084 ‘ ‘| y ‘ 2 
\ \ re | f £ 
; at : 
5 064 0 nih 1 1, 4 1 5 
| aan —_ a 
8 alan ii a 5 
2 0.44 peers "1 ‘ i ‘1 s 
2 sida. I 1 1 1 Nn 
a ee -340000 tt 1 vf 1 6 8 
1 1 f 1 
: igre ert 4 = 2 
tl elie el LT ie ig 5 = 
1 1 1 1 
OR w w w Ww L u w w H ' H ‘| ' . 3 
So a ae ae (a i" 
BS Sle 255 5 5 4 8 a ce 
S 5 S S L. -f" et Li hana Seer os 
endogenous i ! 4 1 r i , 
sf3a2 1 ' ! ' 
: ae i uw 
-340000 _ ; : i : i : i i fs 
mChery transgene endogenous sf3a2 er Te z 5 
uy Vy 1 Ly = 
__ F GGAACAGAGGGAAGCGTGATCT GCGTTAGAGACCATCGACATCAA He cl Ht a 
pair 1 1 1 ig its 
R CACCTTGAAGCGCATGAACTCC CCCCTCATTATTGTGCAGCGTG i (a ia : ia =| 
; 1 1 1 1 
F CAGAGGGAAGCGTGATCTGGAC GCGTTAGAGACCATCGACATCAATAA P-value one-tailed y y ¥ y 
pair 2 tam an eS! 0,00290827 0.00002807 0.00000042 —_0.00508534 
R CTCCATGTGCACCTTGAAGCG AGTGTGTGCAAGATAACTCCCCT ee eee 
sf3a2 transgene 
Extended Data Figure 7 | Validation of sf3a2 promoter code in stable detected by SL-CAGE in three wild-type and four mutant sf3a2 promoter 


transgenic lines. a, sf3a2:mCherry transgene expression in wild-type(WT;top transgenic lines. Promoter sequence is shown at the top, with point mutations 
four panels) and mutant (MUT; bottom four panels) sf3a2 promoter transgenic _introduced to disrupt W-boxes highlighted in red. For each transgenic line 


embryos (high stage). Scale bar, 1 mm. b, Quantitative polymerase chain two tracks are shown: TSSs used by the endogenous sf3a2 promoter 
reaction (qPCR) results of transgene expression: ratio between abundance of (upper track, more intensive colour) and TSSs used by the transgene 
sf3a2:mCherry trasgene mRNA and endogenous sf3a2 mRNA. Bar height (lower track, lighter colour). P value (one-tailed Welch’s two sample t-test) 


shows mean value across six qPCR experiments (three replicas for each of the _ of the difference between mutated and wild-type promoter variants is shown 
two different primer pairs) and error bars denote 95% confidence interval ofthe for four maternal TSSs associated with disrupted W-boxes (dashed boxes). 
mean. c, Sequence of primers used in qPCR. d, TSSs and their relative usage 
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Extended Data Figure 9 | Validation of Tbp morphant phenotypes before ChIP-seq analysis. Zebrafish fertilized eggs were injected with the reporter 
ChIP-seq analysis. Tbp morphants show epiboly defects and differential loss constructs as indicated on the right followed by injection of morpholino 
of reporter activity as previously described***’. Embryos in the 30% epiboly (Tbp 5 mismatch or Tbp mol morpholino”, respectively). Fluorescence 
stage showing epiboly delay and corresponding Tbp-independent (ntl:yfp)and images on top, bright field images below. ntl, notail; yfp, yellow fluorescent 
Tbp-dependent (f-actin:yfp) reporter gene activities were used for H3K4me3 __ protein. Scale bar, 400 um. 
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Extended Data Figure 10 | Dinucleotide maps: calculation and visualization _ that position and assigning weights to surrounding positions. An example of 
of dinucleotide patterns. Genomic sequences (of the same length) are sorted _ calculating the value at position S4,P7 is shown. Surrounding positions are 
and aligned into a matrix-like representation (P,—Pjo, positions; $;-Sjo, coloured on the basis of the weights assigned to them by the Gaussian kernel 
sequences). Marking the presence of selected dinucleotide (for example, AA) by (bandwidth = 1 in both dimensions, and covariance = 0 between the two 

1 and the absence by 0 creates an occurrence matrix. Next, a weighted averageis | dimensions). Averaged values are mapped to different shades of blue to 
calculated at each position by placing a two-dimensional Gaussian kernel at visualize the dinucleotide density across the set of input sequences. 
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Addendum: Optical-field-induced 


current in dielectrics 


Agustin Schiffrin, Tim Paasch-Colberg, Nicholas Karpowicz, 
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Nature 493, 70-74 (2013); doi:10.1038/nature11567 


Calculations in this Letter’ and in ref. 2 are sensitive to the dipole matrix 
elements, Lvigej (i,j = 1,2), describing the transitions between the two 
uppermost valence (v1 and v2) and two lowest conduction (cl and c2) 
bands (see Supplementary Information to ref. 1). They have resulted in 
good agreement with all observations of both experiments for Z,,-1 =3 A, 
Zyiv2 =Zeic2 = 2 Aand Zy2,2 = 0. By mistake, Fig. 1b and Supplemen- 
tary Fig. 4 of ref. 1 show adiabatic levels of silica for Z,,-1 =1 A (Zyt.y25 
Ze1c2 and Z, -2 are correct and set as above). Here we correct this error 
in Fig. 1 by plotting the adiabatic levels of silica for the value of Z, -1 =3 A 
that we actually used. This corrected level diagram prompts a correc- 
tion of the qualitative picture mediated by Fig. 1b and Supplementary 
Figs 4 and 6 of ref. 1, aiming at a qualitative insight of the strong-field 
optical phenomena reported in refs 1 and 2. The corrected picture sug- 
gests an analogy to a phenomenon known in semiconductor physics 
for more than two decades. We summarize these findings below. The 
experimental and quantitative results presented in refs 1 and 2 stand as 
published, as do the conclusions derived directly from them. 

As we have argued in the Supplementary Information to ref. 1 for 
slowly varying fields of strengths |F|~1—3 V A“, electronic states are 
predicted to undergo Wannier-Stark localization’ within approximately 
one unit cell of the wide-gap material—silica—under scrutiny. At a field 
strength where a level anticrossing denoted by Al occurs in Fig. 1, Zener 
tunnelling* between the localized Wannier-Stark states across |Al| —1 
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Figure 1 | Electronic transitions between the valence and conduction bands 
of silica in a strong optical field. Schematic of the adiabatic energy levels as a 
function of external electric field, for interband dipole moment Z,1 -1 =3 A. 
A crystal with a thickness of 50 unit cells in the field direction is considered. 
The anticrossings with 4]= 1 and Al=2 are indicated. Red circles denote levels 
with the valence-band wavefunctions; blue circles denote levels with the 
conduction-band wavefunctions. Closed circles indicate filled states and open 
circles indicate empty states. The arrows indicate level anticrossings passed 
diabatically (solid crossed arrows) and adiabatically (dashed parallel arrows). 
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unit cells is expected to occur, with a rate equal to the anticrossing energy 
gap (in frequency units) AE)/(27h). As Fig. 1 reveals, for |Al| > 2, this 
rate is very small with respect to the carrier frequency, wy, / 2n, of the 
optical field (where subscript ‘L’ is for ‘laser’). Hence, all these anti- 
crossings are passed diabatically in the oscillating strong laser field. 
The system follows the diabatic terms (shown by the solid crossed arrows 
in Fig. 1) as the field increases or decreases without significant transfer 
of population between the valence band and the conduction band. This 
is because the system passes the anticrossings too quickly for Zener tun- 
nelling to occur. Such an anticrossing is mostly inconsequential (almost 
‘ignored’ by the system). 

As the corrected level diagram in Fig. 1 reveals, the gap becomes very 
large, with AE;/h >> @,, for the ultimate level anticrossing at |Al| = 1 
occurring when the field amplitude approaches and exceeds the critical 
field strength Foit = Ag / ea where A,~9 eV is the bandgap of silica, e is 
the charge on the electron and a is the lattice constant). The large anti- 
crossing gap implies a large probability of Zener tunnelling between the 
valence band and conduction band states, which are Wannier-Stark- 
localized at the nearest neighbours. This anticrossing is passed predo- 
minantly adiabatically, as indicated by the dashed parallel arrows in 
Fig. 1. This leads necessarily to the lower anticrossing level gradually 
changing its wavefunction from valence band to conduction band (the 
well known exchange of quantum numbers in adiabatic anticrossings) 
while remaining fully occupied. This gradual change can be quantified 
by projecting the actual filled, adiabatic field-modified valence band 
states to zero-field conduction-band states as given by Supplementary 
equation (10) in ref. 2. This projection results in a transient population 
of virtual conduction-band electrons (the evolution of which is shown 
in figure 3d of ref. 2) and the corresponding virtual valence-band holes. 

These are virtual carriers similar to those responsible for virtual 
photoconductivity in semiconductors theoretically predicted’ and 
experimentally observed® over two decades ago. However, the analogy 
is incomplete. In our case, the virtual carriers are produced by a non- 
resonant (mostly adiabatic), strong-field (and hence, necessarily, non- 
perturbative) excitation, in contrast with early work?-®, where they 
were excited via a near-resonant perturbative excitation. These virtual 
carriers make the system more polarizable in a static electric field°~* 
and likewise at optical frequencies. If so, they can account for both the 
transient reflectivity shown in figure 3b of ref. 2 and the transient 
current displayed in figure 3 of ref. 1. In both cases, a strong few-cycle 
field nonlinearly enhances the strong-field polarization of the dielec- 
tric by creating virtual carriers. These respond with increased excur- 
sions to a weaker optical field, which probes the resultant enhancement 
in reflectivity and separates the charges that—after being trapped in the 
electrodes or within the insulator—subsequently induce a current through 
an external circuit in refs 2 and 1, respectively. Note that our quantitative 
quantum model'” fully takes into account this virtual-carrier mechanism. 

We emphasize that our original interpretation’” of the strong field caus- 
ing a reversible quantum transition (that we name ‘semi-metallization’) 
from an insulating state to one with increased conductivity is valid. It allows 
one to gain a useful independent insight into highly non-equilibrium 
non-perturbative processes in strong ultrafast optical fields (see also 
ref. 9). We note that recent ab initio calculations’® for insulators in 
strong fields are interpreted in terms of the field-induced insulator to 
metal transition. 

In conclusion, we have provided a unified qualitative explanation for 
the optical-field-induced transient reflectivity” and current’ in terms of 
virtual electron-hole pairs generated by a strong optical field. In sharp 
contrast with virtual photoconductivity of semiconductors**, these 
effects are induced by photons far below the bandgap and respond 
nearly adiabatically—in a highly nonlinear, nonperturbative fashion— 
to the instantaneous field, rather than its cycle-averaged intensity as in 
refs 5-8, as it approaches F,,i¢. This opens up the prospect of exploiting 
the new effects for advancing solid-state high-speed metrology and signal 
processing towards the petahertz frontier. 
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In Fig. 1a of this Article, the highest value on the colour scale should 
be 11,772 instead of 1,772. This has been corrected in the HTML and 
PDF versions of the paper online. 
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RETRACTIONS 


A clean slate 


Mistakes are part of science. But setting the record straight 
promptly and clearly can help to avoid a career blot. 


BY VIRGINIA GEWIN 


fter 18 months of complex testing 
A« retesting, Pamela Ronald became 

certain that she needed to retract two 
high-profile papers on disease resistance in 
rice’’. The hardest part, says Ronald, a crop 
scientist at the University of California, Davis, 
was staying calm — she worried about the 
implications for current and past lab members 
and about others spending time replicating 
potentially faulty work. 

The papers had claimed to identify a bacterial 
protein that could activate an immune response 
in rice plants with a specific receptor. But when 
new members of her team were unable to repro- 
duce the results, alarm bells started ringing. 


Shaken, they decided that the first step was to 
genotype all the laboratory strains in their col- 
lection. Eventually they caughta labelling error: 
two of the 12 strains thought to lack the protein 
in question actually lacked a different protein. 
And the careful backtracking unearthed yet 
another error: the test, which they had used to 
verify that this protein could trigger resistance, 
turned out to be faulty. Despite her distress, 
throughout the ordeal Ronald was straightfor- 
ward with journal editors and her colleagues 
about the likelihood of retractions. She knew 
that her scientific reputation depended on com- 
plete transparency about possible errors. “You 
just have to set aside emotions and let the scien- 
tific process pull you through,” she says. 
Worldwide, retractions are on the rise: last 


year alone, scientific journals retracted roughly 
500 papers (of more than 1 million published), 
compared with fewer than 50 per year in the 
early 2000s (see Nature 478, 26-28; 2011). 

One study’ — in the life sciences — suggests 
that misconduct, such as plagiarism or falsi- 
fied data, has been to blame for two-thirds of 
retractions (see Nature 490, 21; 2012). And 
behavioural ecologist Daniele Fanelli of the Uni- 
versity of Montreal in Canada, who studies the 
issue, says that at least one-quarter are the result 
of unfortunate mistakes. The rise in retractions 
could be because scientists are making more 
errors, but it could also indicate a growing cul- 
ture of coming clean on errors. And that, Fanelli 
says, is a positive trend. “We really need to think 
more about how to reward retractions that are 
correcting mistakes — find a way to make them 
a badge of honour instead of a badge of shame,” 
he says. 

Scientists often treat retractions as dirty 
secrets. The muted discourse means that the 
process is often much more confusing, frus- 
trating and embarrassing for researchers, jour- 
nal editors and universities than it needs to be. 
Many struggle with the best way to correct the 
record and with how to salvage viable data. Yet 
ifa retraction is the result of an accident or hon- 
est error, it should not bea blot on an otherwise 
respectable publication record. Scientists and 
journal editors who have retracted papers say 
that the process can be handled productively, 
whether the errors are from contamination, 
a cell-line mix-up or statistical analyses gone 
awry. Above all, they say, transparency is key. 


SENSING A PROBLEM 

A decade ago, retractions were far from trans- 
parent. “It was not unusual to see ‘Paper is 
retracted’ [in the journal] and nothing else,” 
says Ferric Fang, a microbiologist at the Uni- 
versity of Washington in Seattle who also stud- 
ies retractions. In 2009, realizing that journals 
lacked policies, the UK-based non-profit 
organization COPE (Committee on Publica- 
tion Ethics) published guidelines on how best 
to correct the scientific record. COPE expects 
its more than 9,000 journal members to follow 
the guidelines, which recommend that retrac- 
tion statements link to the retracted article, 
be freely available and state who is retracting 
the article, among other criteria. But perhaps 
most importantly, says former COPE chair Liz 
Wager, a retraction notice should include the 
reason that the retraction was made to clearly 
distinguish misconduct from honest error. 
“The retraction has to be complete and > 
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> honest, and clearly articulate what things are 
wrong,’ says Inder Verma, a molecular biolo- 
gist at the Salk Institute for Biological Studies 
in La Jolla, California, and editor-in-chief of 
Proceedings of the National Academy of Sciences. 


RETRACTION BREAKDOWN 

There are several ways to amend a published 
paper (see ‘Retraction guide’). A retraction is 
reserved for the most severe problems, ones 
that unravel a paper’s conclusions. It is hard 
to retract a paper and later republish the valid 
parts, says Arturo Casadevall, editor-in-chief 
of mBio and a microbiologist at the Albert 
Einstein College of Medicine in Bronx, New 
York. “Many journals wont allow that” 

For less serious issues, there are two other 
options. For the least egregious errors — a 
mislabelled figure, for example — a correction 
often suffices. Alternatively, the paper can be 
partially retracted if the erroneous findings 
do not invalidate the article’s stated conclu- 
sions. (COPE discourages partial retractions, 
however, saying that they “make it difficult for 
readers to determine the status of the article 
and which parts may be relied upon”) 

Once alerted to a potential problem, the first 
step is to identify whether an error has actually 
occurred and, if so, how. The authors and jour- 
nal editors then need to decide on the appro- 
priate response. As soon as Ronald realized 
something was amiss with her original work, 
she contacted the editors of the journals that 
had published the papers. “Part of you wants to 
retract it immediately,’ she says. But she knew 
that she needed to ascertain the facts first. 

Her team worked out all the experiments 
that would be needed to determine the prob- 
lem. “Everyone had slightly different concerns, 
but we all wanted to get it right,” she says. Find- 
ing the error became all-consuming; other 
projects languished. 

And Ronald went a step further. Once she was 
sure that there was a problem, she contacted col- 
leagues to highlight the issue and gave a public 
seminar to inform the international community. 
“T was alarmed that others were trying to build 
on this work when we couldn't, and I didn’t want 
to waste anyone's time,” she says. Her efforts 
won her praise both from her colleagues and 
from the Retraction Watch blog, which reports 
on scientific retractions and misconduct cases. 


RETRACTION GUIDE 


Ina post about Ronald’s case, Retraction Watch 
co-founder Ivan Oransky said that the blog likes 
“being able to point out when researchers stand 
up and do the right thing, even at personal cost”: 
Deciding whether to issue a full-blown 
retraction is difficult. Reputations are on the 
line, and authors may 
not necessarily agree 
on how best to amend 
the mistake. It can 
take a lot of negotiat- 
ing with journal edi- 
tors and colleagues. 


That was the experi- 
ence of Daniel St John- 
ston, a developmental 
biologist and director 
of the Gurdon Insti- 
tute at the University 
of Cambridge, UK. The greatest 
Heandhiscolleagues Currency 
had published two Wwe have as 
papers** describing Scientists is 
a pathway that they respect.” 
thought was essential Arturo Casadevall 
for the epithelial cells 


surrounding a developing egg to maintain their 
polarity, or orientation, under starvation condi- 
tions. However, they later discovered that they 
had damaged some of the cells when they had 
dissected the tiny ovaries of the starved fruit 
flies, resulting in ‘false clones’ that mimicked 
the appearance of the cells of interest, but that 
did not have the same mutations. 

St Johnston first wrote a new paper high- 
lighting the existence of false clones. Once that 
paper was accepted in Biology Open’, he sent 
the manuscript to the two journals that had 
published the original incorrect papers, Jour- 
nal of Cell Biology and Developmental Cell, in 
a bid to find the best way to link the new find- 
ings to the original papers. The journal edi- 
tors advised him to retract the original papers 
because the main conclusions were no longer 
valid. Then things got complicated. 

One author was not happy about the prospect 
of losing the data that remained sound from 
the scientific record. But to retract the papers, 
all authors had to approve. The researchers 
and journal editors discussed various options, 
including a partial retraction, but couldn't 
reach an agreement. To break the deadlock, 


Publishers have a range of options for amending the scientific record, depending on the circumstances. 


Action Example 


Correction (erratum) Typographical errors 


Mislabelled figures 


Expression of concern 


Partial retraction 


Retraction 


Error invalidates work 
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Author/contributor list incorrect 


Investigation suggests that data are not reliable 
Suspicions of misconduct 


Figure or table based on corrupt data 
Some data inappropriately analysed 


Clear evidence of misconduct 


No 


Unsure 


Yes 
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Paper conclusions affected 


Yes, but not overall finding 


St Johnston arranged for the valid data to be 
published alongside the false-clone findings in 
Biology Open’, and the retractions went ahead. 
St Johnston says that he had to figure out the best 
course as he went along: “There’s no obvious 
mechanism in place to handle the useful, valid 
content of a retracted paper.’ He adds that, so 
far, he has not experienced any career backlash. 


SURMOUNTING STIGMA 

The data support what many journal editors 
advise — that the best way to overcome the 
stigma associated with a retraction is to come 
clean with a detailed account of mistakes. A 
study published last November* found that 
authors who self-report mistakes and retract 
papers accordingly will not lose out on citations 
to their previous, legitimate research. However, 
authors who fail to self-report the need for a 
retraction lose up to 12.5% of citations per year 
per paper, five years after the retraction, com- 
pared with non-retracted papers with similar 
citation patterns. “Being transparent preserves 
your reputation as someone honestly seeking 
truth,” says study author Benjamin Jones, a 
higher-education researcher at Northwestern 
University in Evanston, Illinois. 

In fact, a clear retraction may end up being 
worthy ofa citation itself. For example, a 2006 
retraction’ detailed an author dispute over 
whether original findings of ferromagnetic 
behaviour found ina carbon-60 atom remained 
valid in the face of new measures. “I’ve found 
examples of retracted papers that continue to 
be cited because the retraction is clear, suggest- 
ing that people in the field are more discerning 
than they’ve been given credit for,” says Fang. 

In St Johnston's experience, the more com- 
mon case is that scientists bury the error 
— which, he says, harms the scientific com- 
munity by squandering everyone's time and 
resources. “I personally didn't want to do that 
because my reputation is my trademark,” he 
says. “If own up to my mistakes, people know 
that ifI get it wrong, I will tell them.” 

St Johnston knew that retracting the paper 
was the right thing to do, but he says that if he 
had been at an earlier, more precarious stage of 
his career, he might have thought twice. And 
he did have serious concerns about the impact 
it would have on his junior co-authors. “Most 
of the scientific community doesn't bother 


Issued by 
Paper author (ideally) 


Editor 
Author or editor 


Author or editor 


SOURCE: COPE; COUNCIL OF SCIENCE EDITORS 


distinguishing between an error and some- 
thing more sinister,” he says. 

Indeed, some data suggest that early- 
career researchers are right to be concerned. 
Jones has compared how the citation 
records of eminent and less-well-established 
authors are affected after a retraction”. “If 
you are highly established, we see little effect 
if you are one of the authors on a retracted 
paper. If you are not well established, you 
will see a negative effect,” he says. 

Asa then-assistant professor, Geoffrey 
Chang had such concerns when he was 
alerted to a problem with his work, and 
threw himself into discovering the error. A 
crystallographer at the University of Cali- 
fornia, San Diego, he had had a string of 
successes in the form of awards, grants and 
high-profile papers starting in 2001, when 
he was in his late 20s. 

When colleagues called his findings into 
question, Chang pored over his home- 
made computer program and eventually 
discovered that he had transposed two 
data columns. He retracted five papers, 
and republished the corrected molecular 
structures in two papers roughly a year 
after pinpointing the problem. The thor- 
ough, transparent reanalysis of the data, he 
says, corrected the scientific record, which 
helped to maintain his academic standing. 
“It was seven years ago,” he says, “and it’s 
still very much a part of me. But it made usa 
stronger, more careful lab” Chang has since 
secured major grants, including grants 
from the US National Institutes of Health. 

Cases such as Chang’s illustrate that 
retracted papers need not doom a career, 
even a fledgling one. But navigating the 
process requires a willingness to deal with 
policies and distinctions that are not always 
clear-cut. “The greatest currency we have 
as scientists is respect — that colleagues 
respect us and our work,” says Casadevall. 
‘A retraction is survivable — if it’s tackled 
honestly and transparently.” m 


Virginia Gewin is a freelance writer in 
Portland, Oregon. 
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COLUMN 


Strapped students 


Graduate students face myriad financial challenges — 
some more obvious than others, says Susie Crowe. 


financial pitfalls — from tuition fees and 

loans to long stretches without contribut- 
ing to savings. And yet many graduate students 
neglect to plan their finances accordingly. 

The difference between starting a retirement 
plan in your early 20s versus your early 30s is 
in the hundreds of thousands of dollars. This 
realization hit me like a ton of bricks when I 
attended a seminar at my husband's workplace. 
At 28, [had long missed my opportunity to take 
full advantage of compound 
interest. Like many graduate 
students, I had spent my early 
20s focusing on day-to-day 
financial challenges rather than 
on the big picture. 

There are smaller, often over- 
looked financial traps. They 
include interest charges from liv- 
ing off credit cards; fees for late tui- 
tion payments; library fines; parking 
expenses; and unintentionally 
paying for your own research 
because you have lost receipts for 
fieldwork or laboratory expenses 
(see Nature 501, 579-581; 2013). 
These seem trivial and short term 
compared with larger concerns, 
but they can mean the difference 
between comfortably paying your 
bills and barely scraping by. 

Many expenses can be avoided or mitigated 
if you stay organized and remain on top of 
your finances. But in some cases, this is not 
so simple. 

The money I am paid comes from so many 
sources it makes my head spin. Most students 
hope that income from teaching-assistant 
and research-assistant posts, external awards, 
internal scholarships, one-time entrance 
scholarships, tuition waivers and reimburse- 
ments will outweigh tuition and other fees, 
research costs, administrative charges, union 
dues and ‘clawbacks’ (funds that must be 
returned to a university administration owing 
to accounting errors). It can be tough for even 
the most financially savvy graduate student to 
keep a handle on things. 

These days, my biggest worry is that my 
degree will drag on for longer than my cur- 
rent funding — a fate I have witnessed often 
enough to make every thesis setback seem like 
an ominous premonition. Sinking thousands 


Pp ostgraduate education is a minefield of 


of dollars into tuition for extra semesters is 
probably not a scenario envisioned by eager 
new students, but it is one of the more serious 
financial risks they can incur. 

Given this bleak picture, what is a financially 
conscious student to do? The tired cliché of 
the impoverished graduate student is trotted 
out like a comforting inside joke, but it does 
not have to be the case. It is possible to make 
it through postgraduate education in decent 

financial shape. 

Getting a head start in 
research was more important 
to my financial solvency than 
I initially appreciated. I got a 

job as a field assistant for an 
ornithology project early in my 
undergraduate studies, which 
gave me the skills to think up my 
own thesis project and led toa 
first-author publication in Animal 
Behaviour. This project gave my CV 
the boost it needed to get me a com- 
petitive national scholarship. 
Itry to be frugal and forward- 
thinking. It is important to be 
mindful of how tax laws apply 
to you — for example, in 

Canada, full-time students 

do not pay taxes on scholar- 

ships, but part-time students 

do. Choose a retirement plan early, 

and pay into it whatever you can. With com- 

pound interest, even the smallest efforts can 
yield big returns. 

‘Work hard and work smart is a useful 
mantra. Efficiency buys time that can be spent 
on side jobs or business ventures to provide 
supplemental sources of income and a back- 
up plan if things go wrong. Students who stay 
focused and graduate quickly will avoid sinking 
money into extra tuition. 

Be realistic. For prospective students put 
off by the financial difficulties that they may 
need to endure, I suggest thinking critically 
about whether a PhD will benefit you. Do the 
intellectual and career benefits seem worth the 
struggle? The hard truth is that if a prospec- 
tive student has trouble defining what makes 
a PhD ‘worth it’ in the long run, then perhaps 
itis not. m 


Susie Crowe is a doctoral candidate in biology 
at Carleton University in Ottawa, Canada. 
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Ua SCIENCE FICTION 


MARKET FORCES 


BY IAN STEWART 


hen Andrew Gordon was cross- 
ing the railway lines and failed to 
notice the approaching InterCity 


express, he didn’t have time to think about 
death. But ifthere had been time, he would, 
as an atheist, have thought: “This is the end.” 

Clearly it wasn't. 

He found himself standing in an eerie, 
luminous landscape. A narrow path wound 
its way through banked clouds to an elabo- 
rate gateway, where a figure in gleaming 
white robes waited. He wore a halo. Short 
stubby wings sprouted from his shoulders. 

This may require a rethink. “Saint Peter?” 

“Security pass, please, sir.” 

Andrew managed a shaky “Sorry, I don’t 
have a pass.” 

“Ah. New soul in the Cloud” The man 
cocked his head, as if listening. “Gordon, 
Andrew Donald? 15 Wysteria Way, Dorridge?” 
Andrew nodded. “We dont always get notified, 
you see. Omniscience is all very well, but... 
too much information. Accident, was it?” 

“No idea. Never saw it?” 

Another cock of the head. “Train. 10:43 
from Wolverhampton. Running late. Hard 
luck? 

Something here wasn't right. “You are 
Saint Peter? These are the pearly gates?” 

The man shook his head. “You're thinking 
of the previous administration, son. We don't 
do sainthood any more.” 

“But — your halo. Wings.” 

“Retro look. This month’s promotional 
initiative from Marketing. This is the Secu- 
rity Entrance, and I’m Pete the Security 
Guard. We don't let any old riff-raff in” 

“Only true believers? I have a horrible 
feeling —” 

“Belief? No, that’s obsolete. What matters 
now is added value on your soul.” 

“I always thought souls were a mistaken 
reification of patterns of brain activity. You 
mean I really do have a soul?” 

“Tt's more a case of your soul has got you.” 
Seeing Andrew’s baffled stare the guard added, 
“Youre a Christian atheist, that’s your problem? 

“Tm nota Christian!” 

“No, but the God you don't believe in is the 
Christian God.” 

Andrew bristled. “I don't believe in any 
gods.’ He looked sheepish. “Well, I didn't.” 

“Sure. But the main one you disbelieved in 
was the one your culture tried to get you to 
believe in. That coloured your expectations for 
the afterlife that you also don't believe in. Not 
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Paradise revamped. 


total nonsense, but... anyway, omnipotent 
or not, the Old Man adopted a new business 
model. He's now CEO, Chairman and CFO of 
Paradise Group. Holy Trinity, right? The after- 
life is nowa business, not a public service. The 
archangels have become the board of direc- 
tors; archbishops and the Pope are relationship 
managers down in the Mundane. We had to 
let the cherubim and seraphim go, of course?” 

“What about Hell?” 

“Hades Inc.? Our main competitor before 
Belle’s new strategic vision.” Seeing Andrew's 
puzzlement, he added: “Belle Z. Bubb, Hades’ 
former Director of Human Resources, now 
CEO, Chair, CFO and Director of Inferno 
Technology as well.” 

“The Devil is female?” 

“Belle’s not exactly — look, there's no glass 
floor any more.” 

“You mean glass ceiling,” Andrew said in 
reflex. 

“No, I don't,” said Pete, looking down 
through the clouds. “Where were we? Oh, 
yeah, souls. A soul isn’t a thing, Andy. It’s a 
spiritual instrument. An option on you, real- 
ized at death. Could be a call option, could 
bea put option” 

“Sorry, I don't understand the jargon.” 

“Call option gives the right to buy ata set 
price; put option gives the right to insist that 
the other party buys.” 

“What caused the changes?” 

“Lack of regulatory oversight, lavish 
bonuses, loss of stakeholder power, loss of 
employee power, third deadly sin ...” He 
leaned closer to whisper in Andrew’s ear. 

“Actually, mate, I 
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perverse incentives. Most trade now is in 
derivatives. Conduct Default Swaps, Inno- 
cence Rate Caps... not enough actual souls 
to justify the bonuses, you see. So the whole 
enterprise is built on sand, to quote the 
Chairman. A few years ago both companies 
nearly went belly-up because theyd accumu- 
lated a speculative bubble in CDOs” 

“CDOs?” 

“Collateralized Death Obligations. You 
take out an option ona portfolio of souls with 
bodies as collateral, package them together, 
trade the packages. Trouble was, no one knew 
what the things were worth. Thought it didn't 
matter: they could always realize the collat- 
eral assets. But when the market went belly- 
up and dead bodies were being dumped by 
the million, their spiritual value collapsed. 
Then Belle and the Old Man decided it was 
inefficient to compete, and went for horizon- 
tal integration. Merger in all but name.” 

“What about good and evil? They surely 
didn’t merge those?” 

“Course not. Under the previous manage- 
ment structure, our primary stakeholder 
base believed that whatever the Old Man 
said was good, was good. That gave author- 
ity to their concepts of morality and ethics 
and justice. Apparently. We still follow that 
principle. Anything that the directors of Par- 
adise Group approve is good, by definition. 
Course, as part of the merger, same goes for 
Hades Inc.. It does save a lot of argument.” 

“But — what about people?” 

“Not sure what youre driving at, son.” 

“What do ordinary people gain from all 
this ... commercialization?” 

Pete looked shocked. “Gain from it? Noth- 
ing. Why should they? People aren't on the 
corporate radar. They're commodities now, 
not customers.” 

“But Pete, I’m people. What will happen 
to me?” 

The Security Guard sighed. “Well, Andy... 
now youre dead, your soul — the contract on 
you — matures. The option position unwinds, 
and...” He cocked his head for the third time, 
pursed his lips in satisfaction. “Got that, Gabe. 
OK, Andy: the good newsis, your soul is cur- 
rently part of Paradise Groups portfolio” 

Andrew almost collapsed with relief. Then 
he realized he might not have the complete 
picture. “And the bad news?” 

“Tt’s a put option.” m 


Ian Stewart, emeritus professor at the 
University of Warwick, writes popular 
science books and science fiction. 
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